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701 Pennsylvania Avenue, N.W. 
Washington D.C. 20004-2696 
Telephone 202-508-5000 



EDISON ELECTRIC 
INSTITUTE 

6 March 1996 


TO: Science and Technology Assessment Committee 

FROM: John Kinsman, Edison Electric Institute 

SUBJECT: February Meeting Minutes; March Meeting 


Attached are the draft minutes from the STAC's February 15, 1996 meeting. Also 
attached is an agenda and directions for the March 14 meeting to be held at the National 
Rural Electric Cooperative Association in Arlington, Virginia. 

I can be reached at telephone number 202/508-5711 if you have any questions. 

Attachments 

cc: John Novak, Bill Fang 
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Global Climate Coalition - Science and Technology Assessment Committee 
Agenda - March 14> 1996 Meeting 


The meeting wi11 start at 11:00 a.m. on March 14, 1996 at the offices of the National Rural Electric 
Co-op Association (NRECA), Directions to NRECA are: 


Address: 4301 Wilson Boulevard 

Arlington, Virginia 

11th Floor - General Managers Conference Room 
Ask Receptionist on 11th Floor 

Subway: Orange Line to Ballston Exit (Virginia) 

Escalator to street (corner of Fairfax and Stewart) 

There is a large vacant lot across from subway. Go North one block 
to Taylor, and left to Wilson. NRECA is the 3rd high-rise on your 
right, just before Wilson. 


From Route 66 outside the Beltway:. 

Take Glebe Road/Fairfax Drive exit. Stay on Fairfax Drive. (Glebe 
is second light.) One block past Glebe, turn light onto Taylor and go 
to 3rd building. Parking in basement 

Coming on Route 66 from Washington: 

Exit Glebe Road. Ramp has light. Go left on Glebe. Third light 
should be Fairfax Drive. Turn left - go one block and turn right on 
Taylor. iTiird high-rise on right. Parking in basement. 


Aguufe 

1 • Minutes of February 15,1996 Meeting - Kinsman 

2. GCC Report - Shlaes/Holdsworth 

3. Discussion of IPCC at SBSTA and AGBM - Bernstein, et al. 

4. Other significant events at SBSTA and AGBM - Bernstein, ct al. 

5. Update on Communications Committee Activities on Health Effects - Worneldorff 

6. Follow-up to Science Primer - Bernstein 

7. Next Meetings - April 11 Conference Call 

May 16 at? 

8. Other Business 

The meeting will adjourn by 3:30 p.m. 
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Draft Minutes 


Global Climate Coalition (GCC) 

Science & Technology Assessment Committee (STAC) 
Edison Electric Institute, Washington, D.C. 
February 15,1996 


The meeting was called to order by co-chairs Porter Womeldorff and Lenny Bernstein 
at 12:30 p.m. followed by introductions. EEI host John Kinsman served as secretary. 

Attachment A is the agenda and Attachment B is the list of attendees. 

Minutes from January Meeting 

The following changes need to be made to the draft January 18, 1996 meeting 
minutes: 1) amend point 7 to minutes as follows: "approve the use by the GCC of a 
paper..."; and 2) point 5 -- spelling of Krauthammer in question. 

GCC Report 

Eric Holdsworth distributed a memo and diskette providing summaries from Bronson 
Gardner of materials on IPCC (Attachment C is the memo; contact Eric at the GCC for 
the diskette). 

Eric noted that there is a more concentrated focus on environment by Administration. 
Eileen Claussen says the U.S. needs an insurance policy approach, going beyond a 
no-regrets approach. Tim Wirth says that there will be boosted interest in climate 
change due to extreme events and the IPCC findings. The U.S. should adopt 
preventive measures per Cold War. 

Eric noted that Argonne National Laboratory is asking for sector-specific actions that 
can be implemented, with a letter being distributed to industry. 

Russell Jones reported on a GCC meeting with Steve Seidell, of CEQ, who’s in charge 
of the Administration’s Biennial Review of the Climate Change Action Plan (CCAP). 

The GCC is trying to find out exactly what economic analyses are going forward by 
Administration, some of which will be ready in a few months. The Administration will 
be selective in what data it releases. Seidell wouldn’t tell exactly what analyses are 
being done or by whom. To date, the White House has been very unhappy with 
studies of costs and jobs (i.e., the results have not been to their liking). 


1 


AIAM-050998 



Porter reported that the still-unreleased CCAP biennial review was promised for 
release last Fall. It has not been determined whether there will be public review. 

Seidell says there will be no surprises and that there will be some individual programs 
dropped, reduced or changed. The gap does not look good. Porter says the gap is 
at 70-100 MMTCE. There will be an internal draft in 6 weeks, followed by 6 more 
weeks of interagency review. Claussen has telegraphed that Plan will include non¬ 
voluntary programs. 

Eric Petersen is in charge of a DOE report on its voluntary programs. No similar EPA 
report is known of. 

1605 data base access is not yet available. There is some thought that limited 
reporting into the data base may cast the program in poor light. 

State Department Briefing 

Lenny Bernstein, STAC co-chair, reported on this briefing, with participants including 
DOS’s Mark Hambley, Jonathan Pershing and Dan Reifsnyder plus 50 industry 
representatives. The Administration supports IPCC science. Hambley made five 
points: GHG concentration up due to man’s activities; discernible human influence on 
climate; 0.8-3.5 degree C by 2100; cost-effective measures exist for developed and 
developing nations to reduce emissions; and justification exists for going beyond no¬ 
regrets. 

Porter noted a shift in the Administration to make GCC a significant election issue and 
that the Administration will become aggressive. 

Hambley noted that the IPCC will undertake three types of activities: the IPCC Third 
Assessment Report (circa 2000), special reports, and technical reports (quicker review 
procedure). He invited industry suggestions for subjects for the latter by February 20, 
1996. Does the STAC wish to recommend to the GCC Operating Committee any 
position regarding the future workings of the IPCC, particularly regarding topics for 
reports (it is believed that the current list includes: GHG inventory methodologies, 
technology assessment, more evaluation of scenarios, models, and mitigation 
options)? 

Porter suggests the need for a paper on accounting and reporting of emissions of 
greenhouse gases other than C0 2 as one of the first papers. The STAC supports this 
recommendation which Porter will report at the February 16, 1996 Operating 
Committee meeting. In addition, Chuck Hakkarinen suggested that the difference 
between simple models (e.g., the model of Wigley et al. in Nature) and complicated 
models has not been investigated. Chuck will produce a short description of this issue 
which Porter will report to the Operating Committee. 
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The IPCC technical reports will be able to be produced in less than one year; the 
review process has been simplified and industry will have a single opportunity for 
input. 

Three workshop will be held in concert with the upcoming SUBSTA/SBI/AGBM 
meetings - 1) quantified emissions limitations and reductions objectives (QUELRO) -- 
models, insights, timing, rates, etc.; 2) policies and measures, with U.S. on CH 4 , 
Canada on SF 6 and Canada on land use; and 3) NGO Input. The last workshop will 
include breakout sessions on industry, municipalities and environmental NGO’s. Each 
of three NGO groups to write it’s own report and Secretariat its own report. This leads 
to mechanism for NGO input at COP-2 in July. SUBSTA will discuss the results of the 
workshop at its July meeting and report to COP-2 also in July. 

The U.S. is producing a paper on consideration of an amendment to the FCCC as 
opposed to a Protocol. A protocol would be adopted by consensus, while an 
amendment to FCCC would be adopted by 3/4 vote. Some think that if the proposed 
legal instrument is an amendment it would die because the G77 would not want the 
FCCC opened up to the possibility of more requirements for developing countries. 

AGBM-5 may be delayed to November or December due to the Secretariat’s move 
from Geneva to Bonn. 

Follow-up on Science Primer 

The STAC approved the draft for presentation to the Operating Committee on 
February 16, 1996. Additional comments can be submitted through February 24 to 
Lenny Bernstein. If approved by the Operating Committee, the report could be used 
in Geneva. Thanks were extended to Lenny for his significant work in leading this 
effort. 

Lenny noted that the “alternative views" section from a previous version of this 
document deserves to be resurrected as a separate and rewritten document. This 
should be discussed at the March meeting. 

GCC Position Paper on Funding of Climate Research 

Russell Jones reported on a "Draft GCC Statement in Support of Global Climate 
Scientific and Economic Research" (Attachment D) by a small working group of STAC 
and Economics Committee. The STAC found the statement acceptable after one 
comment from Chuck Hakkarinen was addressed. Jones will present the statement to 
the Operating Committee on February 16, 1996. 
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GCC Follow-up to IPCC Report 


Porter referenced again that Bronson Gardner has done a fair amount of work 
summarizing the IPCC reports available on diskette from the Coalition (Attachment C). 

Health Issues 

Lenny Bernstein introduced Dr. Barry Friedlander of Exxon, who discussed global 
climate change and human health. Exxon is beginning to put together a white paper 
regarding what’s known about the relationship between climate change and risks to 
human health, to address: diseases affected by temperature, including strengths, 
weaknesses and data gaps; and key players and universities in health science arena 
world-wide. Exxon may attend key meetings. Exxon will evaluate models 
characterizing exposure. He hopes to have a finished white paper in May 1996 and to 
share it with the STAC. Porter noted that whatever can be shared with the STAC 
would be extremely helpful. 

Porter asked for a bibliography, especially of the key writings on the topic. 

Lenny Bernstein noted that Mobil and Chrysler have volunteered medical experts, the 
names of which Lenny will provide to Barry. Lenny added that the GCC is looking for 
an expert with acceptable credentials to serve as a outside third-party spokesman for 
the GCC, asking Barry for any recommendations. Barry feels that there are good 
groups to work with (e.g., some of the folks at Johns Hopkins). 

Porter stated that it is expected that the Administration is going to push publicly the 
likelihood of vector-borne disease increases with small changes in temperature and 
other health related issues. 

Plans for Future Meetings 

The March meeting of the STAC will be held on Thursday, March 14 at the National 
Rural Electric Cooperative Association. NRECA is located one block from the Ballston 
Metro station on the orange line in Arlington, Virginia. 

If required, the April meeting will be a conference call on April 11. 

Other Business 

API has produced a document entitled "Fact or Fiction" that has been reviewed by 
Porter and a few others. Porter recommended that the document be referred to the 
STAC for review. Lenny Bernstein will collate the comments of John Holt, Eric Reiner 
and Howard Feldman. Eric Reiner has corresponded with Benjamin Santer (lead 
author of Attachment E) who has asked to rebut a paper by Hugh Elsaesser. Eric will 


4 


AIAM-051001 



invite Dr. Santer to make a presentation to the STAC. Eric has also received 
correspondence from Tom Wigley indicating his interest in continued discussion with 
the STAC. A further visit by Dr. Wigley should be considered after the Second 
Assessment Report is formally published. 

John Kinsman of EEI distributed a presentation on global climate change science he 
made to an EEI committee and requested comments on it from STAC members 
(Attachment F). 

Chuck Hakkarinen distributed an order form for the upcoming formal publication of the 
IPCC Second Assessment Report (Attachment G). 

Adjourn 

The meeting was adjourned at 3:15 p.m. 
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Attachment A 


GLOBAL CLIMATE COALITION 
SCIENCE AND TECHNOLOGY ASSESSMENT COMMITTEE 
DRAFT AGENDA for the MEETING OF FEBRUARY 15,1996 

The February 15, 1996 meeting of GCC-STAC will be held at the offices of Edison Electric 
Institute, 701 Pennsylvania Avenue, NW, starting at Noon. The draft agenda is as follows: 

1. Minutes from January, circulated by Southern Company 

2. GCC Report - Holdsworth/Shlaes 

3. State Dept. Briefing on Upcoming SBSTA, SBI, and AGBM MeetingsBernstein, et al. 

4. Workshop on NGO inputs at SBSTA Meeting - Bernstein, et al. 

5. Follow-up on Science Primer - Bernstein 

6. GCC Position Paper on Funding of Climate Research - Gerhi, et al. 

7. GCC Follow-up to IPCC Report - Womeldorff 

8. Health Issues - Bernstein, et al. 

9. Plans for Future Meetings - 3/14 at NRECA, 4/18 at ? 

10. Other Business 

11. Adjourn (no later than 4:00 p.m.) 
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Attachment B 


Attendance 

Global Climate Coalition (GCC) 

Science & Technology Assessment Committee (STAC) 
Edison Electric Institute, Washington, D.C. 
February 15,1996 


Name 

Affiliation 

Phone/Fax 

Lenny Bernstein 

Mobil 

703/846-3530/-2972 

Porter Womeldorff 

Womeldorff Associates Ltd. 

(for Illinois Power) 

217/422-9174 (ph & fax) 

Greg Dana 

Assn, of Int’l Auto Mfrs. 

703/525-7788/-8817 

Howard Feldman 

American Petroleum Institute 

202/682-8340/-8270 

Barry Friedlander 

Exxon Biomedical Sciences 

908/873-6702 phone 

Chuck Hakkarinen 

Electric Power Research Inst. 

415/855-2592/-1069 

Roy Hamme 

Duke Power 

704/875-5935/-5493 

Eric Holdsworth 

Global Climate Coalition 

202/637-3164/638-1043 

John Holt 

National Rural Electric 

Cooperative Assn. 

703/907-5805/-5517 

Russell Jones 

American Petroleum Institute 

202/682-8545/-8408 

John Kinsman 

Edison Electric Institute 

202/508-5711/-5150 

Eric Kuhn 

Cinergy 

513/287-4061 /-3499 

Tom Rasmussen 

National Mining Association 

202/833-9636 fax 

Eric Reiner 

3M for Chemical Mfrs. Assn. 

612/778-5079/-6176 

Jerrel Smith 

Union Electric 

314/554-2106/-4830 
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Attachment E 


*RI 

Electric Power 
Research Institute 


FAX TRANSMISSION 


DATE: February 13,1996 

Environment Group 

FROM: Chuck Hakkarinen 

Pages: 25 (including cover) 

FAX: +1 (415) 855 1069 

EPRINET: ea250ch 

Internet: chakk@mecca.epri.com 



Name 

Fax No. 

Company 

To: 

Porter Womeldorf 

217-422-9174 

Illinois Power 


Leonard S. Bernstein 

703-846-2972 

Mobil Corporation 


Bob Gehri 

2054377-7938 

Southern Company Services 


Howard Feldman 

202-682-8270 

API 


John Kinsman 

202-508-5225 

EEI 


Eric Reiner 

612-778-6176 

3M for CMA 


Eric Kuhn 

513-287-3499 

Cinergy 

cc: 

Donald L Rheem 

202-639-8685 

E. Bruce Harrison Company 


Dear GCC-STAC Colleagues: 

I received this afternoon the peer-review paper by Santee et al, on detection and 
attribution on climate change, published in the December 1995 issue of ClimatP rumamics . 
Here is die copy I promised. 

C/JL 

Dr. Chuck Hakkarinen 

Environment & Health Sciences Business Unit 


Facsimile Transmission - HAKK's FAX 


Voice: 4-1(415)855-2592 • Fax: +1(415)855-1069 * EPRINET: ea250oh • Internet: chakk@mecca epri com 
Headquarters: 3412 Hlllview Ave, PO Box 10412, Palo Alto, CA 94303-0813 USA • +1(415)855-2000 
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Dynamics 

£ SpnncfcT'-Veriaf 1995 

Towards the detection and attribution 
of an anthropogenic effect on climate 

Benjamin D. Santer', Karl E. Taylor 1 * 1 . Tom M. L. Wjgler 3 . Joyce E- Penneri. Philip D. Jones 4 
and Ulrich Cubasch' 

: Program for Climate Model Diagnosis and InteTcomparison, Lawrence Livermore National Laboratory. Livermore. CA 94550. 

USA* 

1 Global Gimate Research Division. Lawrence Livermore National Laboratory. Livermore. CA 94550. USA 
National Center for Atmospheric Research. Boulder. CO 80307-3000, USA 
4 Climatic Research Unit. University of East Anglia. Norwich- NR4 7TJ. UK (YQTii •- • 

■'Deutsches KlimareeheniCMnim. Bundessirasse'55. D-20146 Hamburg. Germany * • ' - , .. 

. * * 1 . 

Received; 10 November 1994 1 Accepted 1 19 July 1995 i". .• ... J *- 


Abstract. It has been hypothesized recently that re¬ 
gional-scale cooling caused by anthropogenic sulfate 
aerosols may be partially obscuring a warming signal 
associated with changes in greenhouse gas concentra¬ 
tions. Here we use results from model experiments in 
which sulfate and carbon dioxide have been varied in¬ 
dividually and in combination in order to test this hy¬ 
pothesis. We use centered [/?(*)] and uncentered [C(0] 
pattern similarity statistics to compare observed time- 
evolving surface temperature change patterns with the 
model-predicted equilibrium signal patterns. We show 
that in most cases, the C(i) statistic reduces to a meas¬ 
ure of observed global-mean temperature changes, and 
is of limited use in attributing observed climate 
changes to a specific causal mechanism. We therefore 
focus on R(t). which is a more useful statistic for dis¬ 
criminating between forcing mechanisms with different 
pattern signatures but similar rates of global mean 
change. Our results indicate that over the last 50 years, 
the summer (JJA) and fall (SON) observed patterns of 
near-surface temperature change show increasing simi¬ 
larity to the model-simulated response to combined 
sulfate aerosol/COi forcing. At least some of this in¬ 
creasing spatial congruence occurs in areas where the 
real world has cooled. To assess the significance of the 
most recent trends in R(t) and C(f), we use data from 
multi-century control Integrations performed with two 
different coupled atmosphere-ocean models, which 
provide information on the statistical behavior of “un¬ 
forced’ trends in the pattern correlation statistics. For 
the combined sulfate aerosol/CO; experiment, the 50- 
year R (r) trends for the JJA and SON signals are high¬ 
ly significant. Results are robust in that they do not de¬ 
pend on the choice of control run used to estimate nat¬ 
ural variability noise properties. The R(r) trends for 
the CCVonly signal are not significant in any season. 
C(0 trends for signals fiom both the C0 2 -only and 
combined forcing experiments are highly significant in 


Correspondence to: B. D. Santer 


all seasons and for all trend lengths (except for trends 
over the last 10 years), indicating large global-mean 
changes relative to the two natural variability estimates 
used here. The caveats regarding the signals and natu¬ 
ral variability noise which form the basis of this study 
are numerous. Nevertheless, we have provided first 
evidence that both the largest-scale (global-mean) and 
smaller-scale (spatial anomalies about the global 
mean) components of a combined CO?/anthropogenic 
sulfate aerosol signal are identifiable in the observed 
near-surface air temperature data. If the coupled-mod¬ 
el noise estimates used here are realistic, we can be 
highly confident that the anthropogenic signal that we 
have identified is distinctly different from internally 
generated natural variability noise. The fact that we 
have been able to detect the detailed spatial signature 
in response to combined C0 2 and sulfate aerosol forc¬ 
ing. but not in response to CO 2 forcing alone, suggests 
that some of the regional-scale background noise 
(against which we were trying to detect a CCN-only sig¬ 
nal) is in fact part of the signal of a sulfate aerosol ef¬ 
fect on climate. The large effect of sulfate aerosols 
found in this study demonstrates the importance of 
their inclusion in experiments designed to simulate 
past and future climate change. 


1 Introduction 

Most previous greenhouse-gas (GHG) detection stud¬ 
ies employing the ‘“fingerprint” strategy introduced by 
MaddeD and Ramanathan (1980; see also MacCracken 
and Moses 1982) have used some form of pattern cor¬ 
respondence statistic to compare the partem of a mod¬ 
el-predicted GHG signal with the time history of ob¬ 
served patterns of near-surface temperature changes 
(e.g.. Barnett 1986; Barnett and Schlesiuger 1987; Sant- 
er et al. 1993). The signal pattern in such studies is 
usually taken from an equilibrium COa-doubling ex¬ 
periment, or towards the end of an experiment with a 
time-dependent CO; increase. 
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The strategy in this method is to search for a long¬ 
term, positive trend in the pattern correspondence sta¬ 
tistic. which would indicate an increasing expression of 
the CO- signal in ihe observations. Previous finger¬ 
print studies have employed either uncentered statis¬ 
tics. in which the searched-for signal consists of both a 
pattern and the global-mean change (e.g., Barnett and 
Schlesinger 1987: Hegerl et al. 1994). and/or centered 
statistics, in which the global-mean change is removed 
and the signal is simply a spatial anomaly pattern 
(Santer et al. 1993). A recent investigation making use 
of an uncentftred statistic reached the conclusion that a 
model-predicted CO; signal was identifiable with a 
high level of confidence in the observed data (Hegerl 
et al. 1994). This and previous studies have shown that 
the global-mean change is an important component of 
a CO; signal. A large body of earlier work has also 
shown that observed changes in global-mean annually 
averaged near-sutface air temperature are significant 
relative to various statistical- and model-based esti¬ 
mates of natural variability noise (e.g., Wigley et al. 
1989; Wigley and Raper 1990.1991a. b: Bloomfield and 
Nychka 1992; Karl et al. 1991: Stouffer et al. 1994). 

Such studies have not directly addressed the issue of 
establishing an unambiguous link between changes in 
some external forcing factor and changes in observed 
climate. This is the attribution issue. It is possible that 
different external forcing mechanisms (or combina¬ 
tions of external forcings and natural variability) can 
give rise to similar changes in the global-mean state. 
Since changes in the value of an uncentered statistic: 
largely reflect a change in the global mean, such ana¬ 
lyses provide little help in discriminating between dif¬ 
ferent mechanisms that could have produced such a 
change. (Unless the change in the global mean is so 
large that it cannot be explained by non-anthropogenic 
forcing mechanisms and/or natural variability.) The 
studies cited above suggest that we might attach high 
confidence to the statement that the observed changes 
in global-mean temperature over the past century are 
significant relative to current ‘best estimates’ Of the 
magnitude of natural variability noise. However, the 
same investigations do not allow one to attach high 
confidence to the statement that the observed changes 
are solely attributable to an enhanced greenhouse ef¬ 
fect. 

Our level of confidence in attributing observed 
changes to anthropogenic influences would be in¬ 
creased if we could demonstrate that even sub-global- 
scale spatial features of a model-predicted anthropog¬ 
enic signal showed a correspondence with observed 
changes. This is where centered statistics are useful 
since they focus on anomalies about the global mean. 
The only previous study that employed a centered sta¬ 
tistic to search observed records of near-surface aix 
temperature for model-predicted CO; signals failed to 
show any meaningful multi-decadal positive trends in 
the measure of pattern correspondence (Santer et al. 
1993). This negative result has a number of possible ex¬ 
planations, such as errors in the predicted C0 2 signal 
pattern, or masking of sub-global-scale features of the 


CO- signal by low-frequency natural variability and/or 
other forcing factors (anthropogenic sulfate aerosols, 
volcanic aerosols, solar variability, etc.). 

Model experiments recently performed by Taylor 
and Penner (1994: henceforth TP) may help to clarify 
whether the detection and attribution of an anthropo¬ 
genic effect on climate can be facilitated by incorporat¬ 
ing the climatic effects of anthropogenic sulfate aero¬ 
sols. Sulfate aerosols arise mainly from the SO- emit¬ 
ted by fossil fuel combustion. Such aerosols are likelv 
to have caused some degree of regional-scale cooling 
(Wigley 1989), both directly through clear-sky radia¬ 
tive forcing (reflection of incident solar radiation) and 
indirectly due to changes in cloud brightness (Wiglev 
1989,1991: Charlson et al. 1991.1992; Kiehl and Brie- 
gleb 1993; Taylor and Pennsr 1994: Charlson and Wj- 
gley 1994). 

The TP integrations used an atmospheric general 
circulation model (AGCM) coupled to a mixed-layer 
ocean and a tropospheric chemistry model to investi¬ 
gate the climate response io the individual and com¬ 
bined forcing bv anthropogenic sulfate aerosols and 
C0 2 . The results from these experiments indicate that 
the combination of sulfate and CO; forcing yields a sig¬ 
nal pattern substantially different from the greenhouse 
warming pattern typically produced by AGCMs cou¬ 
pled to mixed-layer oceans (e.g.. Manabe and Stouffer 
1980; Hansen ei al. 1984; Schlesinger and Mitchell 
1987: Washington and Meehl 1989) or fully-coupled at¬ 
mosphere-ocean GCMs (CGCMs: Stouffer et al. 1989:. 
Cubascb et al. 1992: Meehl et al. 1993). 

The TP integrations represent a first attempt to si¬ 
mulate the three-dimensional climate response to an¬ 
thropogenic sulfate aerosol forcing with an AGCM 
coupled to a model of tropospheric sulfate chemistry, 
The only comparable studies at present are those by 
Roeckner et al. (1995) and Mitchell et al. (1995a, bj. 
These investigations also considered the temperature 
response to a combination of CO- and anthropogenic 
sulfate aerosol forcing, but the direct radiative effect of 
the aerosol was parameterized by changing the surface 
albedo. Neither study allowed changes in climate to in¬ 
fluence the aerosol distribution, whereas in the TP 
study two-way interaction is possible between climate 
and aerosols. 

To date, only two previous attempts have been 
made to search observed records of near-surface tem¬ 
perature changes for a GCM-predicted, two-dimen¬ 
sional temperature-change pattern due to sulfate aero¬ 
sols or some combination of sulfate aerosol and CO- 
forcing (Santer ex al. 1995a: Mitchell et al. 1995b). Re¬ 
lated investigations of the possible impact of sulfate 
aerosols on observed temperature data have thus far 
been restricted to analysis of observed changes in areas 
where sulfate aerosol emissions and/or forcing are like¬ 
ly to have been large (Wigley et al. 1992; Engardt and 
Rodhe 1993; Hunter et al. 1993; Kari et al. 1995) or to 
a visual comparison of observed temperature change 
patterns and patterns of vertically-integrated aerosol 
concentration predicted by a chemical-transport model 
(Engardt and Rodhe 1993). 


AIAM-051010 



p. 


FEB 13 '96 11:15PM EPRI EMV SCI +1 415 855 1069 


Samer ei aL: Towards the detection and attribution of an anthropogenic effect on climate 


79 


In the present study we extend the pattern correla¬ 
tion analysis of Santer ei al. (1993) and investigate 
wnether the near-surface temperature signals in the TP 
sulfate-only. CO;-onIy ■ and combined forcing experi¬ 
ments are increasingly evident in the observed data. 

The structure of this paper is as follows. Section 2 
provides a brief description of the TP numerical ex¬ 
periments. In Sect. 3 we compare the surface tempera¬ 
ture responses in the three TP perturbation experi¬ 
ments using maps of the geographical distribution of 
temperature change, pattern correlations, and tinivar- 
late t-tests. In Sect. 4, we define centered and uacen- 
tered pattern similarity statistics and briefly consider 
their relative merits in the context of detection and at¬ 
tribution of observed climate change. Section 5 then 
employs these statistics to search the observed record 
of near-surface temperature changes for the seasonal 
and annual signals from the three TP perturbation ex¬ 
periments. The question of whether the trends in our 
pattern similarity' measures are significant is addressed 
in Sect. 6. A short summary and conclusions are given 
m Sect. 7. * 


2 Experiments 


The experiments discussed here have been described 
in detail by TP. The integrations were performed with 
the GRANTOUR tropospheric chemistry model de- 
y, e t ,°f ed 31 Lawrence Livermore National Laboratory 
(Vr alton et aL 1983). coupled to the Livermore version 
o the National Center for Atmospheric Research 
Community Climate Model (NCAR CCM1; Taylor 
and Ghan 1992). The COM3 AGCM in turn was cou¬ 
pled to a 50-meter mixed-laver ocean mode) with pre¬ 
scribed meridional heat flux. CCMl has 12 layers in 
anc! was 01,5 with a horizontal resolution of 
ca. 4.5 latitude x 7.5° longitude. The GRANTOUR 
tropospheric chemistry' model is a Lagrangian trace 
species model, which simulates the transport, transfor¬ 
mation and removal of various sulfur species (Penner 
et al. 1994a). Only the direct effects of sulfate aerosols 
TAnD^f ed ' For further details of the GRAN- 
TOUR/CCM1 experimental configuration, refer to 


^“Stations were performed: a control ruD 
(UTL) with nominal pre-industrial C0 3 levels (270 
sl2nc * ar ^ IPCC value is 278 ppmv, Eming et 
al. 1994) and no anthropogenic sulfur emissions; a sul¬ 
fate-only experiment (S) with near-presenr-day anthro¬ 
pogenic sulfur emissions (prescribed according to Spi- 
roet al. 1992; Benkowitz 1982) and near-pre-industriaJ 
C0 2 concentrations; a CXVonly experiment (C) with 
no sutrur emissions and nominal present-day CQ 2 lev¬ 
els (345 ppmv; c_f. the 1990 value of 354 ppmv given by 
Entuig et al. 1994); and a combined forcing experiment 
with near-present-day sulfur emissions and C0 2 . For 
the purposes of this study, each of the original TP inte- 
grations was extended by at least 10 years, and temper- 
ature-changc signals were computed using samples 
xrorr. the last ^.0 years of each simulation (following a 
spin-up penod of at least 10 years). 


3 Mode) results 

We compare the surface temperature responses in the 
three perturbation experiments in various wavs, using 
maps of the geographical distribution of temperature 
change, pattern correlations, and univariate t-tests 
(e.g.. Wigley and Santer 1990). For comparing pat¬ 
terns. we computed centered spatial correlations, here 
denoted by R (see Santer et al. 1993). Univariate t-tests 
were used to determine whether (and where) the grid- 
point means were significantly different in the control 
and response experiments. 


3J Sulfaie-only experiment 

Figures 1.2. and 3 show the geographical distributions 
of seasonally and annually averaged changes in near¬ 
surface temperature in the three TP perturbation ex¬ 
periments. In the S experiment, temperature changes 
in DJF. JJA and the annual average are negative at vir¬ 
tually all grid-points. The maximum cooling occurs 
over the Norwegian Sea in DJF (ca. -7°C) and to the 
east of the Weddell Sea in JJa (-6 # C). 

In both DJF and JJA. the location of the maximum 
response differs from the location of the maximum 
forcing: the spatial pattern correlations between radia¬ 
tive forcing and response are only 0.02 in DJF and 0.36 
in JJA (ci. Figs. 1.2. and Fig. 4: also Table 1 and TP), 
The higher correlation in JJA reflects the fact that 
there is some spatial congruence between foreing and 
response where both are large over Western Europe. 
In contrast, the DJF response maximum in the Norwe¬ 
gian Sea is not congruent with a maximum in the forc¬ 
ing. A similar displacement between forcing and re¬ 
sponse maxima is suggested by the results of Roeckner 
et al. (1995) and Mitchell et al. (1995b), 

There are substantial uncertainties associated with 
the forcing and response patterns simulated in such 
studies, and hence with the forcing versus response 
pattern correlations presented above. In both the 
Roeckner et al. (1995) and TP investigations, these un¬ 
certainties are due in part to inadequacies in the treat¬ 
ment of sea-ice and neglect of full ocean dynamics. The 
latter does not apply to the study by Mitchell et al. 
(1995b), which used a full CGCM and still found the 
patterns of temperature response and forcing due to 
the direct effect of sulfate aerosols to be dissimilar. 

The results of all three investigations illustrate the 
potential importance of the atmospheric general circu¬ 
lation in ‘modulating’ the response to regionally-local¬ 
ized forcing. Detection studies that use as their signal 
the global sulfate aerosol distribution or aerosol forc¬ 
ing predicted by a sulfur chemistry model (e.g.. Wigley 
et al. 1992; Engardt and Rodhe 1993), or the global 
pattern of observed changes in sulfur emissions (Karl 
et al. 1995) may therefore be using sub-optimal signals. 
On a regional basis, however, it is likely that there is a 
close relationship between emissions, forcing and re¬ 
sponse in areas with high sulfur emissions. This would 
explain Karl et al.’s (1995) finding of a significant in- 
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Fig- Ja-c. Wintertime (DIF) near-surface temperature 
changes CC) in the three Taylor and Fenner (1994) re¬ 
sponse experiments, with a forcing by anthropogenic sul¬ 
fate aerosols: b COj. and c a combination of anthropogenic 
sulfate aerosols and COj. The perturbations approximately 
correspond ro present-day Sulfur emissions and atmospher¬ 
ic COi levels. All changes were computed using 20-year av¬ 
erages and are expressed relative to the 20-year DIF mean 
of a control run with a nominal pre-industrial C0 2 concen¬ 
tration and no anthropogenic sulfate aerosols 

Fig. 2a-c, As for Fig. 1, but for summertime (JJA) near¬ 
surface temperature changes (’C) in the three TP response 
experiments 

Fig. 3a~c- As for Fig. 1. but for annually averaged near- 
surface temperature changes (°C) in the three TP response 
experiments 
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Table 1. Global Spatial pattern correlations between forcing and 
response fields 


Season 

Forcing 

Response 


SO, 

CO; 

SO.+CO. 


SO, 

0.02 

055 

037 

DJF 

CO; 

020 

-051 

-0.03 


SO* + CO* 

030 

0.15 

037 


so. 

036 

037 

0.3S 

JJA 

CO: 

0.08 

-052 

0.04 


SO, + CO 2 

0.49 

-052 

046 


so. 

036 

051 

043 

ANN 

COj 

0.09 

-056 

O01 


SO, + CO; 

032 

—022 

0.49 


The forcing fields are either the seasonally or annually averaged 
radiative forcing due to the individual and combined of 
sulfate aerosols and CO, (see Fig. 4). The response fields are the 
time-averaged near-surface temperature changes in Figs. 1-3. AH 
temperature changes were computed with 20-year samples rela¬ 
tive to the average over years 11—30 of the TP control integration 
whh no anthropogenic sulfate aerosols and pre-industrial atmos¬ 
pheric CO- levels. All correlations arc for an area-weighted form 
of K (spatial mean subtracted) 


Table 2. Univariate t-iest results for differences in mean state be¬ 
tween the TP control run with no anthropogenic sulfate aerosols 
and pre-industrial atmospheric CO ; and the three TP perturba¬ 
tion experiments with individual and combined CO: and sulfate 
aerosol forcing 


Season 

Experiment 

*= 0-01 

*=0.Q5 


SO, 

665 

93.0 

DJF 

C 0 2 

91.8 

95.8 


SO,+CO; 

56.1 

653 


so. 

89.4 

93-7 

MAM 

C0 3 

89.6 

95.4 


SO, + CO; 

55.2 

65.7 


CO, 

68.3 

93-4 

JJA 

CO; 

90.0 

94.4 


SO* + CO 3 

55.5 

645 


so. 

873 

926 

SON 

CO; 

95.8 

983 


SO j + COj 

54.1 

65.0 


so. 

98-8 

99.2 

ANN 

CO; 

100.0 

100.0 


SO 4 CO; 

65.6 

71.1 


All tests were two-tailed, and use 20-year samples of seasonally 
and annually averaged grid-point near-surface temperature data 
from the control and perturbation experiments. Results indicate 
the number of local (grid-point) rejections of the null hypothesis 
of no difference in means (at stipulated significance levels of 
*—0.01, 0.05), expressed as a percentage of the total number of 
tests performed 


verse relationship between changes in sulfur emissions 
and changes in surface temperature in zones of high 
emissions (50®N—55°N) and at times of high insola¬ 
tion. 
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Note that the maximum Northern Hemisphere cool¬ 
ing occurs in DJF. even though the maximum forcing is 
in JJA and the DJF forcing is relatively small (c.f. Figs, 
la. 2a. and Fig. 4). In JJA. the maximum response is in 
the Southern Hemisphere, even though the forcing is 
largely in the Northern Hemisphere. There are several 
explanations for these results. The fact that the South¬ 
ern Hemisphere demonstrates a substantia) response 
to a forcing primarily confined to the Northern Hemi¬ 
sphere is an indication of the ability of the atmosphere 
to exchange heat and momentum between hemis¬ 
pheres in an efficient way. The generally stronger re¬ 
sponse in the winter hemisphere is in pari due to the 
fact that the effect of changes in sea-ice extent on sur¬ 
face temperature is largest at this time of year (when 
surface inversions are possible). 

A univariate t-test reveals that the changes in mean 
state are highly significant in the sulfate-only experi¬ 
ment (Table 2 and Fig. 5). (All t-tests were performed 
using 20-year samples from the control run and the 
perturbation experiments). Of the total number of 
grid-point tests performed 86.5-98.8% showed signifi¬ 
cant differences in temperatures between the experi¬ 
ment and control (at the a^O.Ol significance level). It 
is dear from these results, even without resorting to 
multivariate significance tests (see Preisendorfer and 
Barnett 1983: Wigley and Santer 1990) that the overall 
(global) differences in means between the sulfate-only 
and control experiments are highly significant 


3.2 C0 2 ’Only experiment 

The spatial patterns of seasonally- and annually aver¬ 
aged near-surface temperature changes in the CO 2 - 
only experiment are very similar to those obtained in 
equilibrium C0 2 *doubling integrations with compara¬ 
ble AGCM/mixed-laver ocean experimental configura¬ 
tions (Manabe and Stouffer 1980; Hansen et al. 1984; 
Schlesinger and Mitchell 1987; Washington and Meehl 
1989). Tine warming is spatially coherent (Figs, lb, 2b, 
3b). The DJF and JJA temperature change patterns 
show the standard picture of equator-to-pole amplifi¬ 
cation in the winter hemisphere, a feature assodated 
with poleward retreat of the sea-ice margin. This pat¬ 
tern similarity occurs despite the comparatively low 
level of the forcing: the C0 2 -only experiment was per¬ 
formed with an atmospheric COa concentration change 
Of only 75 ppmv (from 270 to 345 ppmv), in contrast to 
the ca. 300-330 ppmv changes commonly used in Step- 
function COj-doubling experiments. 

The overall maximum temperature increases tend 
to occur in locations where maximum cooling occurred 
in the sulfate-only experiment, i.e., in the Norwegian 
Sea in DJF (ca. 7°C) and in the Ross and Weddell Seas 
in JJA (ca. 7-8 °C). The annua! average changes do not 
show the same degree of hemispheric symmetry com¬ 
monly found in equilibrium doubling experiments with 
mixed-layer models (Schlesinger and Mitchell 1987). 
This is due to the smaller percentage reduction in sea- 
ice coverage in the Northern Hemisphere: the reduc- 
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non in ice coverage in the Southern Hemisphere is 
nearly three times larger (see Table 1 of TP). The sea- 
sonaJ and annual wanning patterns have a land-sea 
contrast component, with larger changes over land ar¬ 
eas. as in recent transient experiments with CGCMs 
(Cubasch ei al. 1991 Santer ei al. 1994). 

As in the sulfate-only case, the response pattern dif¬ 
fers markedly from the pattern of the forcing, and is 
negatively correlated with the latter (R= -0.51 in 
^3F. ^ = ~-0.52 in JJA; see Table 1). This inverse rela¬ 
tionship is due to the different zonal structures of forc¬ 
ing and response fields: while the response is a maxi¬ 
mum at high latitudes in both hemispheres, the forcin® 
peaks at low latitudes <c.f. Figs. lb. 2b. 4c. 4d). 

The univariate t-test results indicate that the surface 
temperature signal in the C0 2 -only experiment is high- 
ly significant in all seasons and in the annua] average 
(Table 2). Changes in the COj-only experiment show 
consistently higher percentages of grid-points with sig¬ 
nificant differences in means than in the sulfate-only 
experiment, primarily due to the larger global mean 
temperature change in the former experiment 

L45“C versus — 1.19*C. respectively). It should be 
noted that in TP. the climate sensitivity of the model 
for a doubling of CO, was given as 6.4 °C, a result that 
was based on 10-year samples only. The results we 
present here are now based on the’ final 20 years of 
much longer integrations (30-50 vears). The climate 
cooled by several tenths of a degree in the C0 2 -only 
integration before stabilizing towards the end of the in¬ 
tegration. This yielded a much lower estimate of the 
climate sensitivity: 4.8 °C. 


3.3 Combined sulfaie/C0 2 experiment 

Unlike the sulfate-only and CO,-only integrations, the 
patterns of temperature change in the combined sul- 
fate/C0 2 experiment are characterized by spatially-co- 
herem regions of both wanning and cooling (Figs. lc. 
- c > 3c). Temperature decreases are restricted largely to 


■4 -—____ 

Fig. 4a-«L Seasonally-averaged radiative forcing due to sulfate 
aerosolsand CO, in the TP sulfate-only and CO--only expert- 
mems. The difference in the top-of-the-atmosphere clear-sky out¬ 
going SW radiation (in Wm -i ) between simulations with and 
Witlmut anthropogenic sulfate aerosols is shown a for winter 
(DJF): and b summer (JJA). Changes in ihe net long-wave Dux ai 
the tropopausc due to an increase in atmospheric CO: from 270 
ppnlv to 345 ppmv are also given for c winter and d summer 


Fig- 5a-d. Univariate t-test results for seasonally-averaged aea 
surface temperature changes in the TP response experiment 
Results are for a DJF and b JJA changes in the sulfa te-onlv e 
penment and m the experiment with combined CO;/$Ulfatc sen 
so forcing (c, d). Areas, shaded denote regions with experimei 
minus CTL differences in grid-point time averages which are sii 
nificani at the 1,4, 5% and 10% levels (two-tailed tests). Tes 
were conducted with 20-year samples of daw from the comr< 
run and each response experiment. Near-surface temperstui 
nanges in the COj-only experiment (not shown) are highly sii 
raficant at virtually all grid-points ’ 


Table 3. Global and regional 'between-experimsm spatial pai- 
tern correlations 


Season 

Experi¬ 

ment 


CO; 

SO„+CO: 

DJF 

so. 

-0.6J 

— 0 . 7.? 

0.46 

0.48 


CO; 



0.01 

- 0.18 

MAM 

SO, 

-0.53 

- 0 . 3 ) 

-0.26 

0.49 


CO; 



0.79 

0.29 

JJA 

SO, 

—0-59 

-030 

-0.26 

0.58 


COj 



0.79 

0.32 

SON 

so. 

-Q3S 

- 0.40 

039 

0.59 


CO; 



0.52 

0.20 

ANN 

so. 

-0.55 

- 0 . 6 ) 

0.10 

0.60 


OO? 



0.63 

0.01 


Results are for seasonally- and annually averaged near-surfacc 
temperature signals m the three TP perturbation experiments, 
with forcing due to sulfate aerosols. COj. and a combination of 
sulfate aerosols and C0 2 . The perturbations approximately cor¬ 
respond to present-day atmospheric CO; levels and anthropoge¬ 
nic sulfur emissions. The signals are the mean changes (computed 
with 20-year samples) relative to the average over years 11-30 of 
the TP control integration with no anthropogenic sulfate aerosols 
and pre-industrial atmospheric CO- levels. The non-italiciaed 
numbers arc the (centered) pattern correlations obtained using 
the full spatial fields, and the italicized numbers are the correla¬ 
tions over the area defined by the observed data mask for 
to = 1954 (see Fig. A2, panel b) 


tbe Northern Hemisphere, which is where most of the 
radiative forcing associated with sulfate aerosols occurs 
(Fig. 4a. b). The largest decreases are over the Norwe¬ 
gian Sea in DJF (ca. — 2 °C) and over southeastern Eu¬ 
rope in JJA (ca. -2‘C). Warming maxima are in the 
Ross and Weddell Seas in JJA (ca. 4~8°C). and over 
Greenland. Labrador, the Sea of Okhotsk, and a small 
area of Antarctica in DJF (ca. 2 °C). 

The wanning and cooling maxima in all seasons are 
considerably reduced relative to the respective maxima 
in the sulfate-only and CO,-only experiments. In DJF, 
for example, the large temperature changes in the Nor¬ 
wegian Sea in the S and C experiments (Fig. la. b) are 
considerably reduced in the SC integration (Fig. lc). 
While the DJF response patterns in the SC and S inte¬ 
grations show some spatial correspondence (i?=0.46), 
the SC and C response patterns are uncorrelated 
(/? = 0.01: see Table 3). In JJA, however the SC re¬ 
sponse pattern is very similar to the C0 2 -onlv temper¬ 
ature change pattern (f?=0.79). and is negatively cor- 
related with the sulfate-only response pattern 
(J?= -0.26). This is due to the fact that the offsetting 
effects of sulfate ip the regions of maximum tempera¬ 
ture response to CO : forcing are less in JJA than in 
DJF (c.f. Figs, la, b, 2a, b). The pattern similarity be¬ 
tween C and SC and S and SC clearly depends on the' 
relative magnitudes of the CO a and aerosol forcing. 

The univariate t-tests (Table 2) indicate that the 
fractions of tbe globe with significant differences in 
means (SC versus CTL) are consistently lower than in 
either tbe S or C experiments. This is due to compen¬ 
sating warming and cooling responses over large areas 
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of the Northern Hemisphere in both seasons and in the 
annual average, resulting in large areas with relatively 
small changes in the mean staie. Large, spatially-coher- 
enx regions with differences in means significant at the 
1% and 5% levels are generallv restricted to Southern 
Hemisphere oceans and low-latitude Northern Hemi¬ 
sphere ocean areas (Fig. 5). 

The weak response over large areas of the Northern 
Hemisphere, as indicated by a lack of statistical si gnify 
cance. is not an artefact of the length of the control and 
response experiments. For the *present-day’ sulfate 
and C0 2 forcing levels stipulated in the TP SC experi¬ 
ment, large areas of the Northern Hemisphere with lit¬ 
tle or no change in the mean state are a fundamental 
property of the simulated response partem (at least for 
the model used here). 

It is interesting to note that the response patterns 
for the SC integration are, in a purely qualitative sense, 
more similar to observed patterns of seasonal tempera- 
ture change (see Jones et al. 1991: Folland et al. 1992: 
Parker et al. 1994) than the response patterns in the 
individual S and C experiments. We will consider this 
issue further in Sea. 5.1, which presents quantitative 

measures of observed versus simulated pattern similar¬ 
ity. 


The observed data. D(x, r). consist of monthly, 
mean, land-based surface air temperatures and sea-sur¬ 
face temperatures from the combined land-ocean data 
set described by Jones et al. (1991). spanning the inter- 
val 1854-1993. The data are in the form of anomalies 
relative to the mean over 1950-79. They contain high- 
frequency variability components associated with El 
Nino behavior, volcanoes, etc. For the purposes of 
GHG-d erection studies, these components constitute 
undesirable noise: data were filtered in the following 
way to reduce this noise. 

We first define a temporally-smoothed reference 
state (x) centered on the year r 0 as 

A (*)- E D(x, Iq+v) W(v) (2) 

v--* 

where W(v) are the normalized symmetric weights for 
a p-term Gaussian filter centered ai t 0 (with p=2q+l: 
we use p=l3 below). As an example, if j 0 = 1954 , the 
reference climate is the filtered mean over the years 
1948-1960. We then define the (smoothed) anomaly at 
time i relative to A (x) by 

AZJ (*, i) O-ETto (3) 

where 


4 Pattern similarity statistics 


EfoO- E D(x,i + v)W(v) 

(fm 


(4) 


In this section we introduce the pattern similarity sta¬ 
tistics, /? (r) and C(f). which we subsequently use for 
comparing model and observed spatial patterns of tem¬ 
perature change. Hie terminology is similar to that 
used by Sanier et al. (1993). 


4.} Definition and computation of pattern similarity 
statistics 


We fin; compute seasonal- and annual-mean tempera¬ 
ture changes in the TP control run and response ex¬ 
periments. In each of the three TP response experi¬ 
ments (S, C, and SC), the temperature-change signal is 
defined as 

AM(r)=i^ (l )_^; (jc) (1) 

where M denotes model data, with the subscripts ext 
and ctl identifying output from one of the TP re¬ 
sponse experiments and the control integration, re¬ 
spectively. The index x is a discrete variable running 
oversgace (grid -points ), with x=l, n. The overbars 
in Mgxf{X) and A4cn_(x) indicate time averages, here 
computed using 20-year samples of experiment and 
control data. We stress that these signal patterns have 
no time-dependence other than a dependence on sea¬ 
son, since the TP integrations are equilibrium response 
experiments with no interannual changes in the forc¬ 
ing. The rationale for the use of equilibrium signal pat¬ 
terns (as opposed to transient signals) is discussed in 
detail in Santer et al. (1995a). 


We use r=19l0,1911. 1993, so that the filtered ano¬ 

malies cover the 84-year period 1910-1993. while the 
data used in the filtering extend from 1904-1999. Filter 
weights are set to zero if data are missing. Since data 
are not available for 1994-1999, we assigned a missing 
value code for all post-1993 data. We then stipulate 
that a filtered mean can only be computed if a critical 
fraction of the sum of the Gaussian filter weights 
(^cri) is exceeded at any grid-point over any p-year 
period. Here W frt ,=0.6. which allows us to calculate fil¬ 
tered means for 1993. Further details of the filtering 
procedure are given in Santer el al. (1995a). 

The set of time-evolving monthly-mean anomaly 
fields. A2?(x, r). was then used to compute seasonal 
and annual averages. In order to avoid large, spatially 
non-random increases in coverage from the beginning 
of the century to the present, we stipulated that the 
grid-point coverage for A(x, t) must be a subset of the 
coverage for "HI (x) (see Appendix A). For a given re¬ 
sponse experiment and a given season, we now have a 
single pattern characterizing the model temperature 
signal, A M(x), and a series of time-evolving patterns 
cha racterizing observed temperature changes, 

Following Santer et al. (1993), we use two different 
types of measure to compare the spatial fields A M(x) 
and A D (x, t). These are defined by 


*(/) = 

Z (±D(x, i)~£D(t))(AM(x)-£Kl) 


l[ns D {t)s il j\ 


(5) 
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where 

s 'o (0 = (AD (*. l)-&D (t)) 2 /(n - 1 ) ( 6 ) 

is the^ spatial variance (with s& defined similarly), and 

the in Eqs. (5) and (6) indicates a spatial average, 
and 


C(t)m 


E AD(a;f)AW(*)lr I AA/(x) 2 
JL-r-i 


(7) 


R (r) is simply a spatial anomaly correlation (similar 
to statistics used in measuring the ‘skill’ of numerical 
weather predictions; see, for example, Anderson and 
van den Dool 1994) with the statistic centered about 
the spatial means of the observed and simulated fields. 
C(t) is the uncentered statistic Originally defined and 
used by Barnett and Schlesinger (1987). 

C(r) has several desirable properties. First, it is 
equal to 1.0 when A D(x, t)sssAM(x), so that it is a 
measure of the strength of the model signal in the ob¬ 
served data. Second, unlike R(t ), it does not involve a 
time-varying observed term in the denominator, Thus 
a trend in C(r) with increasing time can only be due to 
increasing similarity between the mode) and observed 
mean-change fields in the numerator of Eq. (7). In 
contrast, a trend in R (;) with time may be attributable 
to a change in either the observed mean state and/or 
the observed spatial variance. Third, C(f), unlike JR(r). 
is not bounded by ±1.0. One consequence is that as 
R (0 approaches ± 1.0. changes in the amplitude of the 
observed pattern will not be reflected in linear trends 
in R(t). 

The choice of which type of statistic to use is not 
clear cut. C(/) has the apparent advantage that it in- 
a * n ^ orn ) at ^ r j about the mean change field. 
AD (r). which is an important part of the signal in most 
w ^ e 75(0 focuses on the pattern of change. 
We have shown previously (Sanier et al. 1993, 1995a) 
that C(i) can be decomposed into R(r) and AD(t) com¬ 
ponents. In situations where the model-predicted 
change in global-mean temperature is large relative to 
the observed changes, the R(t) component of C(f) is 
much smaller than the 55(f) component, so time se¬ 
ries of C(l) look very similar to those of 55(f) (see 
e.g., Santer et al. 1993, Fig. 7). In such cases, C(f) can¬ 
not be used to address the attribution issue: i.c., if dif¬ 
ferent external forcing mechanisms give rise to cimiiar 
rates of global mean change, C(f) cannot be used to 
determine which of the forcings caused the observed 
change. It is in this situation that i?(r), which focuses 
on the pattern of spatial anomalies about the mean 
state, may provide the information required to discrim¬ 
inate between forcing mechanisms with different pat¬ 
tern signatures. 

In the following, our emphasis is on attribution; 
hence we prefer to use R(i). We also compute C(r), 
since this enables us to determine the significance of 
global-mean changes and to compare our results with 
other investigations that have used uncentered statis¬ 
tics (e.g., Hegerl et al. 1994). 


5 Comparison of model and observed temperature 
change patterns 

In this secuon. we use both C(r) and R(i) to compare 
model and observed temperature change patterns. For 
each of the three TP experiments and for each season, 
a single pattern characterizing the model signal is com¬ 
pared with 84 observed temperature change patterns 
consisting of filtered data for the years 1910.1911,.... 
1993. The resulting C(f) and R(r) time series show 
whether this fixed pattern is increasingly evident in the 
observed data. Pattern correlations are calculated after 
excluding grid-points with missing observed data, and 
with a reference period centered on ( 0 =1954. Sensitivi¬ 
ty to the choice of reference period i$ minimal as 
shown in Appendix A. 


5.1 Pattern similarity results 

5.1.1 CO r only experiment. Consider first the results 
for temperature signals from the TP experiment with 
present-day COj forcing and no anthropogenic sulfate 
aerosols (Fig. 6). If C0 2 forcing were the dominant in¬ 
fluence on climate, i.e., if other external forcings and 
natural internal variability were relatively small on ti¬ 
mescales appropriate to a slowlv-evolvin'g greenhouse 
warming signal, the R(t) and C(t) time series should 
show strong multi-decadal positive trends as the signal 
became progressively more pronounced in the ob¬ 
served data. 

C(t) shows no evidence of large, positive trends that 
are sustained for 40 to 50 years or longer (Fig. 6). This 
is in accord with results obtained by Santer et al. 
(1993) for the surface temperature signals from C0 2 
doubling experiments performed with five different 
AGCMs. The largest and most sustained C(r) trends 
occur at the beginning of the record (between ca. 
1910-1940), not at the time of most-rapidly increasing 
forcing (see Wigley and Raper 1992). Changes in C(i) 
parallel those in global-mean remperature over the en¬ 
tire observed record (Jones ei al. 1991; Jones and Brif- 
fa 1992). Note also that in all seasons except DJF, C(r) 
time series show the effects of Che Pinatubo eruption in 
June 1991 and the volcanic aerosol-induced reduction 
in global mean temperature (Hansen et al. 1993: Jones 
1994). 

These similarities in the behavior of global mean 
temperature and C(r) are in agreement with the theor¬ 
etical and empirical results of Santer et al. (1993, 
1995a). As shown in Table 4, C(t) changes for all three 
gerturbation experiments correlate strongly with 
55(f), although the correlations are smallest for the 
combined forcing case. This points to 8 dominant in¬ 
fluence of 55(r) on the behavior of C(r), as discussed 
further in Santer et al. (1995a). 

J?(f) behaves quite differently from C(r) (see Fig. 6 
and Table 4). In all seasons, R(t ) increases during the 
same 1910-40 period over which C(r) rises, but then 
decreases and shows high-frequency oscillations about 
some mean siate, with no evidence of a large, positive 
linear trend component over the last 40-50 years. The 
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Fig. 6- Centered R (r) and unctntered C(r) patient correlations 
between model and observed near-surface temperature changes. 
Model changes in seasonally and annually averaged temperature 
are taken from the the TP COjOnlv experiment. For each sea¬ 
son. one rime-independent spatial pattern characterizes the re¬ 
sponse to forcing bv present-day atmospheric CO } concentra¬ 
tions. Observed chances were filtered to remove higb-frcqucncy 
noise, and were expressed as a time series of 64 anomaly patterns 
(from 1910-1993) relative to a fDcered reference period extending 
from 194&-1960. All paitem correlations were calculated after ex¬ 
cluding grid-points with missing data 


Rti), C(t): S0 4 -only signal versus Observations 



7 ‘ A Li. or ^'8- 6. but for near-surfacc temperature chances 
from the TP sulfate-only experiment 


fig- 8. As for Fig. 6. but for near-surface temperature Changes 
from the TP experiment with combined COj and sulfate aerosol 
forcing 


Crf): HAMCTL VS SO./CO-. signal 



Fig. 9. Behavior of pattern correlarion statistics in the absence of 
external forcing. S w (r) and Cv(r) time series were computed us¬ 
ing (filtered) time-dependent patterns of neax-surface tempera¬ 
ture change from j 600-year control integration (HAMCTL) with 
a CGCM and fixed seasonal- and annual temperature-change 
patterns from the TP experiment with combined COj/sulfate 
aerosol forcing 
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Table 4. Correlations between C(r) and R(t) and the 

changes in observed area-averaged near-surface temperature 


Season 

Experiment 

R(t) vs. ai?(() 

C(t) vs. 


SO, 

-0.29 

-0.98 

DJF 

CO : 

0.15 

0.99 


SO,+CO; 

-0.38 

0.93 


SO, 

—0.26 

-0.59 

MAM 

CC: 

0J0 

1.00 


SO,+CO¬ 

-0.19 

0.96 


SO, 

-an 

-0.95 

JJA 

co, 

0-39 

0.99 


SO,+CO ; 

0.06 

0.63 


SO, 

—9-05 

-0.91 

SON 

CO, 

-0415 

0.98 


SO, 4* CO, 

aoi 

0.82 


so. 

-0.06 

-0.99 

ANN 

CO¬ 

0.13 

1.00 


SO, + CO; 

-0.01 

0.91 


Seasonal and annual C(r) and R(t) time series are for the three 
TP perturbation experiments (see Figs. 6-8). Values of A£>(r) 
were computed using the observed daw mask iar in = 1954 (see 
Fig. A2, panel b). For C(r), the changes for all throe perturbation 
experiments correlate sironglv with S5 (r). which points iq a 
dominant influence of 2fB(/) in the behavior of C(t) 


initial increase in R(t) is in accord with the results of 
Wigley and Jodcs (1981) and Keliy et al. (1982). who 
found that the pattern of observed warming in the 
1930s showed some evidence of the high-latitude am¬ 
plification characteristic of the temperature response 
in CQj doubling experiments. 

5.7J Sulfate-only experiment The C (r) time series for 
the sulfate-only temperature signals are virtually the 
inverse of the C(r) time series for the TP CO,-only ex¬ 
periment (c.f. Figs. 6 and 7). This is not surprising, 
since we have replaced a spatially-coherent wanning 
patrem by a spatially-coheTem cooling pattern in com¬ 
puting the pattern similarity statistic, and because 
(apart from the sign) C(r) is expected to parallel S2)(r) 
in both cases. 

For R(t), however, the time series are not simply 
the inverse of the results for the C0 2 -only signals. In 
JJA and SON. R(t) has a large positive linear trend 
over 1940-1970, a period over which the C0 2 -only C(r) 
and R(t) trends are small. 

5J.3 Combined forcing experiment. The R(r) time se¬ 
ries for the comparison of observed temperature 
changes and signals from the TP combined forcing ex¬ 
periment are quite different from the corresponding 
time series for the COj-only experiment (c.f. Figs. 6 
and 8). This is most pronounced in JJA and SON: R( r) 
now shows a large positive trend that is sustained over 
the last 40-60 years of the observed record. The posi¬ 
tive trends in JJA and SON are similar to those for the 
sulfate-only case (Fig. 7). This implies that some of the 
time-increasing congruence between the combined 
forcing Signal pattern and observations is coming from 


areas where cooling occurs, at least over 1940-1970. 
possibly related to sulfate aerosol effects. 

These results suggest that the inclusion of forcing by 
both CO, and anthropogenic sulfate aerosols enables 
one to obtain a better fit between observed variations 
in near-surface temperature and a model-predicted 
temperature response pattern. This is particularly so 
over the 1940-1970 period, when R(i) shows little 
trend for the CCK-only signal (Fig. 6). 

Some insights into the regions that contribute most 
to the trends in R{t) are obtained by comparing the 
linear trends in observed near-surface temperature 
(e.g., over 1946-86; see Karl ei al. 1995) with the TP 
signal patterns from the combined forcing experiment 
For JJA, such a comparison shows that the observed 
cooling over south-eastern Europe, the east coast of 
the U.S.. and the Tibetan/Mongolian Plateau is con¬ 
gruent with the TP temperature change pattern (c.f. 
our Fig. 2c with Fig. 2f in Kar! et al. 1995). This must 
contribute towards the large 50-year trends in R(t). 

6 Significance of /2(f) and C(r) trends 

In the previous section we showed that for certain sea¬ 
sons. the R(t) and C(/) time series for surface temper¬ 
ature signals from the TP experiment with combined 
C0 2 and sulfate aerosol forcing show large, positive 
linear trends over the last 40-60 years. But are these 
trends statistically significant? This is a difficult ques¬ 
tion to answer. In ordeT to make meaningful state¬ 
ments about trend significance, we need information 
about the characteristics of 'unforced' R(r) and C(r) 
trends: i.e.. trends due sole!}' to the effects of natural 
variability on decadal- to century-rime scales. We can¬ 
not obtain this information easily from the observed 
data in view of the difficulties involved in separating 
the observed variability of surface temperatures into a 
component associated with a time-evolving response to 
anthropogenic influences and a component associated 
with natural variability. 

In the model world, however, we can examine out¬ 
put from experiments with no external forcing in order 
to estimate the magnitude and spatial properties of in¬ 
ternally-generated natural variability noise. In this 

study we therefore employ model-generated noise data 
to assess trend significance. We stress that the model 
noise is due only to internal variability of the coupled 
atmosphere-ocean system, and does not incorporate 
variability related to changes in natural external forc¬ 
ings (e.g., in solar output or the volcanic dust loading 
of the atmosphere). 

The 20-year TP control integration is clearly too 
short for estimating natural variability noise character¬ 
istics on time scales appropriate to a slowly-evolving 
anthropogenic signal. We therefore used surface tem¬ 
perature results from two recent multi-centennial con¬ 
trol integrations performed with fully-coupled atmo¬ 
sphere-ocean GCMs to determine the statistical behav¬ 
ior of R (r) and C (t) trends in the absence of external 
forcing. Near-surface temperature data were taken 
from a 600-year control integration with the Hamburg 
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CGCM (Hegeri et al. 1094) (the ECHAM-1 T21 reso¬ 
lution AGCM coupled to the large-scale geostrophic 
OGCM). and a 1000-year integration of the GFDL 
coupled model (De)worth el aL 1993; Stouffer et al- 
1994) (the GFDL R15 AGCM coupled to the modular 
ocean model). These experiments are referred to be¬ 
low as HAMCTL and GFDLCTL, respectively. 

A detailed comparison of the near-surface tempera¬ 
ture variability in HAMCTL and GFDLCTL is beyond 
the scope of this study. However, since data from these 
integrations are used in the significance testing analy¬ 
sis. we provide an overview of model variability differ¬ 
ences in Appendix B. 


6.1 Determination of trend significance 

In order to determine the significance of the R{t) and 
C(r) trends, we need first to establish the sampling dis¬ 
tributions of •unforced' trends in these statistics. The 
method we use is similar to that employed by Santer et 
al. (1995b). 

We treat HAMCTL and GFDLCTL in the same 
way as the observed near-surface temperature data, 
and first define anomalies relative io some reference 
state of the control run 

ANfo f) *N£(jc, 0-NJ (or) (8) 

where N denotes noise data from either HAMCTL or 
GFDLCTL and the reference state Nj - (x) is the time- 
average over the entire control run. As in Eq. (4), data 
in ~Nl(x, r) were smoothed with a 13-term Gaussian fil¬ 
ter, and the index r denotes the center year of an inter¬ 
val of length p ( = 13). The HAMCTL and GFDLCTL 
anomalies were then interpolated to the observed data 
grid, and model data points outside of the observed re¬ 
gion for r 0 = 1954 (see Fig. A2) were excluded from the 
analysis. Unlike the situation when dealing with ob¬ 
served data (see Appendix A), there are no coverage 
changes as a function of time: once the (filtered) ob¬ 
served data mask for i 0 = 1954 has been used to exclude 
model data points from the analysis, this mask does no t 
change with time. We then substituted A N(x, r), AN, 
etc. for the corresponding observed terms in Eqs. (5- 
7), and computed pattern correlation time series in the 
absence of external forcing. These are referred to sub¬ 
sequently as and C^(t). where the subscript N 
indicates that the statistics were derived using either 
HAMCTL or GFDLCTL noise data, and not observed 
data. 

For each pattern correlation statistic, therefore, 
there is one time series for each season and the annual 
average, for each of the three TP response experi¬ 
ments, and for each of the two natural variability inte¬ 
grations. This gives a total of 30 R»(t) time series and 
30 C*(r) time series. Time series utilizing the 
HAMCTL (GFDLCTL) .data are of length 588 (988). 
As an example, we show the Rn(t) and C»(t) time se¬ 
ries for the TP combined forcing experiment and the 
HAMCTL natural variability noise (Fig. 9). Both sets 
of time series have considerable variability on 10- to 


20-year time scales. For C(r). the observed changes 
over the Iasi 40- to 50 years (-0.3-0.4 in the case of 
the CO-onlv signal: see Fig. 6) are noticeably larger 
than Cw(0 changes (*0.2). For RU). however, it is 
more difficult to evaluate whether the observed 
changes are unusual by visual inspection alone. 

We next select an array of trend lengths, L, = t x 10 
(r = l, .... 5). appropriate to the length of signal trends 
we wish to evaluate (10-50 years). We then fit linear 
trends for different I, to non-overlapping sections of 
the Rn(t) and C N (t) time series. The HAMCTL 
(GFDLCTL) sample sizes therefore range from 58 (98) 
for 10-year trends to 11 (19) for 50-vear trends. This 
yields a distribution of slope parameters, 0/v(O. f=1> 

5, for each statistic, season, response experiment, 
and natural variability noise integration. These distri¬ 
butions are the yardsticks against which we judge the 
significance of signal trends, fy(i). 

The signal trends are simply the least-squares linear 
trends for the final 10-50 years of the i?(/) and C(t) 
time series in Figs. 6-8 — i.e.. the trends over 1984—93, 
1974-93, etc. The R (/) time series for the SON signal 
from the TP combined forcing experiment illustrates 
this (see Fig. 10). While the signal trends over the final 
10-20 years of this time series are negative, trends over 
longer intervals are positive and become increasingly 
larger. 

To compute the significance level (p-value) for a se¬ 
lected experiment and trend length L,, we compare 
fty(t') with the sampling distribution of and de¬ 
termine kj, the number of times B^(i)s/8r(i). Note 
that this is a one-tailed test, since we have directional 
information about the signal: the linear trends in R (f) 
and C(f) should be positive. The probability of obtain¬ 
ing the signal trend by chance based on the natural var¬ 
iability manifested in HAMCTL or GFDLCTL is then 
simply 

p ; =kilm ; (9) 

where m, is the sample size for P»(i). Note that the 
number of independent linear trend samples in 
HAMCTL and GFDLCTL may differ from m, due to 
the long decorrelation time of near-surface tempera¬ 
ture, both in models and in the observations (see He¬ 
geri et al. 1994). The most satisfactory way of increas¬ 
ing the number of independent samples would be by 
extending the two control runs used here for thousands 
of (simulated!) years. Such lengthy CGCM integrations 
may become feasible in the next few years, but were 
not available for the purposes of this investigation. 

For the case we are dealing with here, a relatively 
short natural variability time series and relatively Jong 
trend lengths Li, the use of non-overlapping chunks 
provides a noisy estimate of the sampling distribution 
of ‘unforced' 10- to 50-year linear trends. A much 
smoother picture is obtained if overlapping chunks are 
used (as in Wigley and Raper 1990; Santer et al. 
1995b), although this does not lead to a proportionate 
increase in the number of independent samples in the 
distribution. We computed p-values using both non- 
overlappmg chunks and chunks that had the maximum 
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10-Year Trend (1984-93) - 40-Year Trend (1954-93) 

■ 20-Year Trend (1974-93) -— - 50-Year Trend (1944-93) 

■ 36-Year Trend (1964-93) 


Fig. 10. Linear trends for the fi¬ 
nal 10-50 years of the R(r) 
time series for the SON near¬ 
surface temperature signal from 
the TP experiment with com- 
bined COj/sultaie aerosol forc¬ 
ing. These are the signal trends 
&(/). whose significance we 
wish to determine 


overlap (i.e., bv aU but one year). The p-values were 
generally similar in both cases. More importantly, tbe 
decisions on the significance of R(t) and C(r) trends 
did not depend od whether sampling distributions of 
(0 were computed with non-overlapping or overlap¬ 
ping chunks. In the following section, therefore, our 
discussion is restricted to results obtained using sam¬ 
pling distributions with overlapping chunks. 

<5.2 Trend significance results 

COronly signal. The p-values for the near-surface 
temperature signal from the TP COj-only experiment 


are given in Table 5 as a function of trend length, sea¬ 
son. statistic, and natural variability noise experiment. 
Based on noise levels from either of the natural varia¬ 
bility experiments, linear signal trends for C(t) are sig¬ 
nificant at the 5% level or better for all seasons and all 
trend lengths 6:20 years. Since we have shown that 
C(r) basically provides information on observed 
changes in global-mean temperature for the CO ; -only 
signal (Table 4), our resulis imply that the most recent 
20- to 50-year trends in observed global-mean temper- 
aiure are large relative to the variability of ‘unforced’ 
linear trends in HAMCTL and GEDLCTL. 

Numerous other studies have demonstrated that the 


Significance (p-values) for seasonal and annua! ncar-surface temperature signals from the TP 
day COj forcing r 


erpenmeat wiih present- 


Trend length (years) 

Season 10 20 30 40 


R(t) 

DJF 

034 

0.43 

0.38 

0.46 

R(r) 

MAM 

€38 

0-55 

0.78 

0.75 

*(0 

.Da 

0-82 

6.79 

0.38 

033 

*(') 

SON 

0.54 

036 

0.49 

0.48 

m 

ANN 

038 

033 

033 

0.36 

C(T) 

DJF 

04)1 

0.05 

0.00 

0.00 

C(r) 

MAM 

0.00 

6.01 

0.00 

0.00 

C(r) 

JJA 

0.24 

0.35 

<100 

0.00 

C(r) 

SON 

0.60 

0.57 

0.00 

0.00 

C(r) 

ANN 

0.03 

0.10 

0.00 

0.00 

Sample size (m) 

579 

979 

569 

969 


The signals are the linear trends, fi s . for the most recent 10.20,..., 
50 year? of the /? (r) and C(r) time series shown’in Fig. 6 (i.e., the 
trends over yean 1984-93,1974-93, 1944-93). To determine 

the significance of fi Sl we require information on the behavior of 
the centered and uncentered pattern correlation statistics in the 
absence of enema] forcing. This information was obtained by 
correlating the seasonal and annual temperature-change signals 
fiom the TP COj-only experiment with temperature anomalies 
from the 600-year HAMCTL and 1000-year GFDLCTL integra- 
tions. each of which was performed with a CGCM. The resulting 
a*(() and Cw(f) time series for each coupled model control run 


0.09 

0.18 

0.68 

0.67 

0.74 

0.77 

033 

0.37 

035 

035 

0.25 

023 

0.47 

0.39 

0.56 

052 

0.22 

0.09 

036 

0.34 

0.44 

0.38 

0.89 

0.85 

0.06 

0.09 

0.28 

020 

0.42 

0.34 

0.00 

0.00 

0.00 

0.00 

o.od 

aoo 

0.00 

0.00 

0.00 

0.00 

0.00 

0.00 

0.00 

0.00 

0.00 

0L00 

0.00 

0.00 

0.00 

0.00 

aoo 

0.00 

0.00 

0.00 

0.00 

0.00 

0.00 

O .00 

0.00 

aoo 

559 

959 

549 

949 

539 

939 


were then used to generate sampling distributions of ‘unforced’ 
linear trends, $*. for different seasons and trend lengths. The p- 
values were then computed by comparing Ps with the sampling 
distribution (In for the appropriate season, trend length, statistic 
type, and coupled model control run (see Sect. 6.1). Non-itali- 
cized numbers are the p-values based On estimates from 
HAMCTL; results for GFDLCTL are shown in italics. Signal 
trends significant a; the 5% level or better are in bold type. The 
p-values were computed using overlapping 10- to 50-year chunks 
of tbe R»(t) and C#(i) time series. The different sample sizes for 
the two cbupled models arc given in the final row 
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Table 6. A$ lor Tabic 5. but for seasonal and annual near-surface temperature signals from the TP experiment with combined sulfate 
aero$ol/CO: forcing 

Trend length (years) 


Season 10 



DJF 

aw 

0.92 

R(n 

MAW 

0.90 

C.S5 

ft( 0 

JJA 

0.76 

0.68 

R(i) 

SON 

0.81 

0.73 

Rd) 

ANN 

0.95 

0.9! 

C(i) 

DJF 

0.52 

0.5! 

C(r) 

MAW 

0.12 

O.J8 

C(r) 

JJA 

0.38 

0.43 

C(f) 

SON 

0-76 

0.69 

CM 

ANN 

0.4S 

0.50 

Sample 

size (m) 

579 

979 


20 30 


0.88 

0.82 

0.95 

0.87 

0.74 

0.68 

0.62 

0.56 

0.64 

0J8 

0-52 

0.46 

0.61 

0J1 

0.50 

0.4J 

0.79 

0.72 

0.72 

0.64 

0.00 

0.00 

0.00 

0.00 

0.00 

0.00 

040 

0.00 

0.00 

0.00 

(LOO 

0.00 

040 

0.00 

0.00 

0.00 

o,oo 

0.00 

040 

0.00 

569 

969 

559 

959 


40 50 


0.75 

0.65 

0.66 

0.56 

0.84 

0.78 

0.19 

0.20 

026 

0.26 

043 

0.03 

027 

0.25 

041 

0.01 

0.40 

0.34 

0.08 

0.04 

040 

0.00 

0.00 

0.00 

040 

0.00 

040 

0.00 

040 

0.00 

0.00 

0.00 

0.00 

0.00 

0.00 

o.oo 

0.00 

0.00 

0.00 

0.00 

549 

949 

539 

939 


observed long-tenn (greater than *=100 years) changes 
in global-mean, annually averaged temperature are 
significant. Such studies have used either statistical 
models of natural variability (Wigiev et al. 1989: Karl 
et al. 1991; Bloomfield and Nyehka 1992; Allen and 
Smith 1994; but see also Woodward and Gray 1993), or 
have derived natural variability noise estimates from 
one-dimensional upwelling-diffusion models (Wigley 
and Raper 1990. 1991a. b) or CGCMs (Stouffer et al. 
1994). Our investigation differs from such previous 
work in its focus on shorter period (10- to 50-vear) sig¬ 
nal trends, seasonal decomposition of the signal, and 
the use of noise information from rwo different 
CGCMs. 

The only previous study that has considered recent 
short-term trends in global-mean annually averaged 
temperature is that by Allen et al. (1994). which inves¬ 
tigated the significance of the linear trend in global- 
mean low- to mid-tropospheric temperature (sampled 
by the satellite-based microwave sounding unit. MSU) 
from 1979-1994. Allen et al. (1994) used detrended in¬ 
strumental sea-surface temperature (SST) data to esti¬ 
mate the magnitude of natural variability on time 
scales appropriate to the length of their signal, and 
concluded that the 15-year trend in low-to mid-tropo- 
spheric temperature was not significant. 

Hegerl et al. (1994) used an uucentered pattern sim¬ 
ilarity statistic to compare observed and model-pre¬ 
dicted patterns of temperature trends, As in the pres¬ 
ent investigation, Hegerl et al. (1994) found that their 
(non-optimized) detection statistic was largely a meas¬ 
ure of observed global-mean annually averaged tem¬ 
perature change for a COj-only signal. The most re¬ 
cent 20- and 30-year trends in the Hegerl et al. (1994) 
detection statistic were highly significant relative to the 
variance of linear trends in the first 385 years of 
HAMCTL, which agrees with our findings for 20- to 
30-year trends in C(r). Hegerl et al. (1994) improved 
upon their non-optimized results by rotating their fin¬ 
gerprint pattern in a direction in which the C0 2 signal 
could be well-represented and the natural variability 
noise was small (see Hasselmann 1979,1993 for details 
of the optimization technique). 


In contrast to the C(r) results, none of our R(t) re¬ 
sults for the TP COj-only signal achieve significance at 
the 5% level (Table 5). There is little evidence, there¬ 
fore, that the COj-only spatial pattern of temperature 
change (minus the global mean) is steadily evolving in 
the observed data. 

Combined COj/sulfme aerosol signal. The trends in 
C(r) from the SC experiment are significant in all sea¬ 
sons for trend lengths from 20 to 50 years (Table 6). 
This result is similar to that obtained in the previous 
section for the COj-oniy signal, and arises largely 
through the strong correlation between C(t) and 
£D(f)~ It is clear that trends in R(t) also contribute to 
the C(r) results, at least in JJA and SON (see Santer et 
al. 1995a). 

A far more important result is that the R (r) trends 
for the SC signal are significant for trend lengths of 50 
vears in JJA and SON. This result does not depend on 
the model (GFDLCTL or HAMCTL) used to define 
the natural variability noise. It indicates that, in these 
seasons, there is an evolving expression of the SC sig¬ 
nal pattern in the observed data, independent of any 
trend in global-mean temperature. The 50-year R(t) 
trend for the annually averaged SC signal is marginally 
significant for the GFDLCTL noise (p = 0.04) but is 
not significant relative to the HAMCTL noise 
(p= 0.08). Id all seasons, and for both noise integra¬ 
tions, p-values for 50-yeai J? (f) trends are smaller than 
the p-values for 10- to 40-year trend lengths (see Table 
6 ). 

For each of the TP response experiments, we have 
performed 50 individual significance tests of the ob¬ 
served /?(r) trends, i.e., one test for each season (plus 
the annual average), trend length, and natural variabil¬ 
ity experiment. A certain fraction of these tests would 
be expected to yield significant R(i) trends due to 
chance alone (see WigJey and Santer 1990). For the SC 
signal, five R(t) results were significant at the 5% level 
or better. The probability of obtaining this result due 
to chance alone is ca. 10% (this was estimated with the 
binomial distribution). Taken together with the result 
that none of the R(t) trends for the C0 2 -only signal 
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aero»l‘fm«h5g TaW< 5 ‘ bUt ^ SeaS °” al ^ annual “»-««*«* temperature signals from the TP experiment with present-day sulfait 


Trend length (years) 

Season 10 20 30 


*(.') 

DJF 

0.76 

0.72 

0.91 

0.83 

0.98 

£>.96 

0.72 

R(i) 

MAM 

0.96 

0.92 

0.64 

0.61 

0.85 

0S3 

0.99 

R(r) 

JJA 

052 

OSS 

0.71 

0.71 

0.69 

0.72 

0J4 

*(0 

SON 

0.63 

0.63 

0.54 

0.51 

0.60 

0.55 

037 

RO) 

ANN 

0.95 

0.91 

0.78 

0.80 

0.99 

0.97 

0.60 

CO) 

DJF 

0.99 

0.97 

1.00 

LOO 

1.00 

LOO 

1.00 

CO) 

MAM 

1.00 

1.00 

1.00 

1.00 

1.00 

LOO 

i.00 

CO) 

JJA 

0.79 

0.66 

iho 

1.00 

1.00 

LOO 

1.00 

CO) 

SON 

(L50 

0.48 

0.99 

0.99 

1.00 

0.98 

0.96 

CO) 

AJSN 

0.9S 

0.93 

1.00 

LOO 

1.00 

LOO 

1.00 

Sample 

size (m) 

58! 

979 

571 

969 

561 

959 

551 


40 


50 


0.63 

0.55 

0.41 

0.96 

0-49 

0.39 

0.60 

0-19 

0J24 

0.38 

0.00 

0.01 

0.78 

0-18 

0.15 

1.00 

1.00 

LOO 

LOO 

1.00 

1.00 

LOO 

1.00 

0.98 

0.91 

0.59 

0.55 

LOO 

1.00 

LOO 

949 

543 

939 


■were significant (and with the consistently larger p-val- 
ues in C than in SC for 50-year R(i) trends; see Tables 
- an ^ 6). it is highly unlikely that the R (r) results in the 
$C and C experiments are due to chance alone. 

Sulfaie-only signal. For the linear trends in R(t) from 
the TP sulfate-only experiment, only one result 
achieves significance at the 5% level or better: the 
most recent 50-year trend in SON (see Table 7). Recall 
that the 50-year i? (/) trend in SON also achieves signif¬ 
icance for the signal from the combined forcing experi¬ 
ment (Table 6). As noted previously, this suggests that 
some portion of the multi-decadal R(t ) trend in the lat¬ 
ter case is due to increasing congruence between areas 
of predicted and observed cooling 

The CM results for the S experiment are generally 
the opposite of those for the C experiment (c.f. Tables 
5 and 7), i.e., C(r) trends are significantly negative for 
most trend lengths and seasons. (Note that differences 
in the C versus S significance results for CM are pre¬ 
dominantly due to differences in A M, since 

cM-Ar>M/*p.) 

7 Summary and conclusions 

Over the past decade, a substantial modelling effort 
has been directed towards predicting the climate 
changes likely to result from past and future increases 
in CQ 2 concentrations. An important question is 
whether the predictions by such models are consistent 
with the observed surface temperature record over the 
last century. 

The answer to this question depends on how the 
searched-for GHG signal is defined. Studies that have 
defined a GHG signal in terms of a pattern that incor¬ 
porates the global-mean change have showed that this 
signal is identifiable with a high level of confidence in 
the observed data (Heger! et al. 1994). Such investiga¬ 
tions have also shown that the global-mean change is 
the dominant component of the signal, and that it is 
this component that makes the largest contribution to 
the overall significance of the results. This complicates 


the attribution of observed changes to the specific 
cause of GHG forcing, since different combinations of 
external forcings and/or interna! variability may yield 
similar global-mean changes. 

Other investigations have attempted to address the 
attribution issue by defining a CO 2 signal in terms of 
spatial anomalies about the global-mean change. The 
rationale here is that it is probably difficult to obtain a 
high level of correspondence between the sub-global- 
scale features of observed- and model-predicted 
changes by mechanisms other than those used in the 
model forcing experiment. To date, studies that have 
searched for model-predicted greenhouse warming 
patterns in observed data after removal of global-mean 
changes have yielded negative or inconclusive results 
(Santer et al. 2993). One plausible explanation for this 
result may be the neglect of other, non-greenhouse ex¬ 
ternal forcings in defining the searched-for signal (e.g„ 
changes in anthropogenic sulfate aerosols, v.olcanic 
aerosols, and solar output). 

Model integrations recently carried out by Taylor 
and Penner (1994) represent a first attempt to simulate 
explicitly the temperature response to a combination 
of C0 2 and sulfate aerosol forcing, and formed the ba¬ 
sis of the current study. The seasonally and annually 
averaged patterns of near-surface temperature change 
were very different in the three response experiments: 
a ‘sulfate-only’ experiment with near-present-day sul¬ 
fur emissions (S), a ‘CCVonly’ experiment with near¬ 
current atmospheric COj concentrations (C), and a 
combined forcing experiment with near-present-day 
sulfur emissions and C0 2 levels (SC). While the inte¬ 
grations with individual forcing showed either global- 
scale, spatially-coherent cooling (S) or warming (C), 
the experiment with combined sulfate/C0 2 forcing 
yielded more complex patterns of temperature change, 
characterized by both cooling and warming, with most 
of the cooling restricted to the Northern Hemisphere 
(see Figs. 1-3). 

To determine whether the signal patterns in the S, C 
and SC experiments were increasingly evident in the 
observed temperature data, we used an observed data 
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set comprising monthly-mean, land-based surface aix 
temperatures and sea-surface temperatures (Jones et 
al. 1991). For each experiment and season, a single pat¬ 
tern characterizing the model signal was compared 
with a sequence of observed temperature change pat¬ 
terns for the years 1910-1993. Two different forms of 
partem similarity statistic were computed: J?(r). in 
which the spatial means of the observed and simulated 
fields are subtracted, and C(r), which retains these spa¬ 
tial means. The two statistics provide different infor¬ 
mation in the detection context (see Sect. 4). 

The time series Of /?(/) and C(r) indicate whether 
the model equilibrium signal is becoming increasingly 
evident in the observed data. For the COj-onlv signal, 
none of the R(r) time series show evidence of a posi¬ 
tive trend sustained over at least 4-5 decades. In con¬ 
trast, the R(t) time series for the SC signal showed 
large, multi-decadal positive trends in JJA and SON. 
This indicates that in these seasons there is an evolving 
expression of the SC signal pattern in the observed 
data, independent of any trend in global-mean temper¬ 
ature. For the sulfate-only experiment. R(r) has a large 
increase over 1940-70 in JJA, SON, and ANN, sug¬ 
gesting that at least some portion of the long-term 
trends in R(t) in JJA and SON for the SC signal is at¬ 
tributable to areas where cooling occurs. In virtually all 
cases, the behavior of C(r) closely parallels changes in 
observed global-mean temperature. 

To assess the significance of the most recent R(t) 
and C ( i ) trends, we used data from multi-century con¬ 
trol integrations performed with the Hamburg and 
GFDL CGCMs (HAMCTL and GFDLCTL. respec¬ 
tively). The two control runs provide intemally-consis- 
tent (but model-specific) estimates of the magnitude 
and patterns of surface temperature variability on long 
time scales. 

For the temperature-change signal from the TP 
combined forcing experiment, our results indicated 
that the most recent 50-year R (r) trends are significant 
in JJA and SON. This result did not depend on wheth¬ 
er HAMCTL or GFDLCTL was used to estimate the 
internally generated natural variability. For the signal 
from the TP sulfate-only experiment, the 50-vear R (r) 
trend in SON is the only result that achieves signifi¬ 
cance at the 5% level or better. None of the R(t) 
trends for the COj-only signal was significant. 

This analysis supports but does not prove the hypo¬ 
thesis that we have detected an anthropogenic climate 
change signal in observed records of near-surface air 
temperature change. We stress that we are relying on 
model noise and model signals in our assessment of 
trend significance. Both have attendant uncertainties. 

Some of the noise uncertainties were illustrated by a 
preliminary comparison of the iow-frequeDcy variabili¬ 
ty of surface temperature in HAMCTL and 
GFDLCTL (Appendix B). This variability shows im¬ 
portant differences in the two integrations, in terms of 
both global means and patterns. Some of these discre¬ 
pancies are due to the different levels of climate drift 
in the two integrations (see Fig. Bl). The primary con¬ 
clusions of our study - the significance of the 50-year 


R (t) trends in JJA and SON for the temperature signal 
from the SC experiment, and the failure to find signifi¬ 
cant J?(r) trends for the C experiment - are not af¬ 
fected by these inter-model differences in variability. 
However, this does not guarantee that our results will 
remain robust when we use estimates of imernally-gen- 
erated natural variability from other CGCMs. as’ these 
become available. 

Furthermore, although we have tried to look at the 
model-dependence of natural variability' noise by con¬ 
sidering results from both HAMCTL and GFDLCTL. 
we have little or no information about the sensitivity of 
the variability in any one model to changes in resolu¬ 
tion. physics, parameterizations. or flux correction 
scheme. Both control integrations considered here em¬ 
ployed large flux corrections to compensate for syste¬ 
matic errors in their atmospheric and oceanic compo¬ 
nents (Gates et al. 1993). It is not inconceivable that 
such flux adjustment procedures can have an impact on 
the simulated multi-decadal- to century time scale var¬ 
iability. Clearly, we urgently require studies that at¬ 
tempt to validate this variability in CGCM control runs 
(Barnett et al. 1995). This is a difficult task (Santer et 
al. 1995b). but without such work substantial uncer¬ 
tainties will remain regarding the reliability of the sig¬ 
nificance levels obtained here. ’ — 

Similar uncertainties affect the signal patterns used 
in this study. As has been noted by Taylor and Penner 
(1994), the predicted pattern of temperature change in 
the SC experiment is dearly a function of the relative 
magnitudes of the positive forcing by greenhouse gases 
and the negative forcing due to anthropogenic sulfate 
aerosols. While the present-day C0 2 fordng is relative¬ 
ly well-known, the current direct sulfate aerosol forc¬ 
ing is uncertain by a factor of at least two (Wigley and 
Paper 1992; Penner et al. 1994b). Furthermore, rela¬ 
tive to other studies (CharlsoB et al. 1991: Kiehl and 
Briegleb 1993). the aerosol forcing in TP is quite large, 
and is at the upper end of the range used by Wigley 
and Raper (1992). Clearly, these forcing uncertainties 
translate into considerable uncertainty regarding the 
spatial pattern of the temperature response in the SC 
experiment, and in the relative contributions to this re¬ 
sponse from COj and aerosols, Further signal uncer¬ 
tainties arise from the lack of a dynamic ocean, the neg¬ 
lect of the indirect effects of sulfate aerosols, the om¬ 
ission of the radiative effects of trace gases other than 
C0 2 , and failure to include the effects of carbonaceous 
aerosols generated by biomass burning, fossil fuel com¬ 
bustion, and industrial processes (Penner et al. 1992, 
1994b). The optical properties of sulfate aerosols are 
very different from those of carbonaceous aerosols. 
The Optical properties of carbonaceous aerosols are 
very different from those of sulfate aerosols. 

A further important signal uncertainty is our use of 
a model-predicted equilibrium pattern of temperature 
change. The use of an equilibrium signal pattern is a 
reasonable strategy in ‘C0 2 -only’ detection stuciics, 
since there is evidence that the equilibrium and tran¬ 
sient temperature responses to C0 2 forcing may have 
strong pattern similarities (see Santer et al.’ 1994, 
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1995a). This similarity is partly due to the relative sta¬ 
bility of the pattern of C0 2 -induced radiative forcing 
as a function of time. For sulfate aerosols, however, 
there have been distinct regional changes in the time- 
evolving pattern of forcing (e.c.. changes related to de¬ 
clining sulfur emissions in the USA and Western Eu¬ 
rope over the past 20 years and increasing emissions in 
China and India: see Karl et al. 1995). Out detection 
strategy does not take these changes into account It is 
likely that a more complete explanation of observed 
changes may be achieved in detection studies based on 
CGCM transient experiments driven by our best esti¬ 
mates of observed spatio-temporal changes in GHG 
and sulfur emissions. 

In summary', there are numerous caveats regarding 
the signals and natural variability noise that form the 
basis of this study. Nevertheless, we have provided first 
evidence that both the largest-scale (global-mean) and 
smaller-scale (spatial anomalies about the global 
mean) components of a combined CCVanthropogenic 
sulfate aerosol signal are identifiable in the observed 
near-surface air temperature data. If the coupled-mod¬ 
el noise estimates used here are realistic, an assump¬ 
tion which thus far has not been rigorously tested on 
the crucial decadal- to century' time scales, we can be 
confident that the anthropogenic signal which we have 
identified is distinctly different from natural variability 
noise. The fact that we have been able to detect the 
detailed spatial signature in response to combined COs 
and sulfate aerosol forcing, but not in response to C0 2 
forcing alone, suggests that some of the regional-scale 
background noise (against which we were trying to de¬ 
tect a COj-only signal) is in fact part of the signal of a 
sulfate aerosol effect on clim ate 

While our confidence in the identification of an an¬ 
thropogenic effect on climate is high, we have not 


shown conclusively that the signal identified can be at¬ 
tributed to the unique cause of anthropogenic sulfate 
aerosols and COv We have taken a first step in the 
direction of attribution by showing that agreement be¬ 
tween modelled and observed changes exists at rela¬ 
tively small spatial scales. Enhancing our confidence in 
the attribution of observed climate changes to anthro¬ 
pogenic causes will require the elimination of other 
forcing mechanisms (such as solar variability and vol¬ 
canoes) as possible explanations for the’ observed 
changes. This should provide strong motivation for fu¬ 
ture modelling studies. 
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Appendix A: sensitivity to choice of reference period 

k is useful to consider how the choice of to. the central 
year of the p -year period used for computing the refer¬ 
ence state 2^(x), influences the R(i) and C(r) Tesults 
for the near-surface temperature signals from the three 
TP response experiments. Sensitivity of trends in R (r) 


Ooserved coverage as a function of time 



1900 1920 1940 1960 1980 2000 

Time (years) 

Available da}# .-. Subset for l t ^ 1906 ----- Subset for = 1954 


Fig. Al. Observed changes in coverage of gridded. annually aver¬ 
aged near-surface temperature data as a function of time. The 
curve “available data' shows the change in coverage for the GQ- 
tered Jones et al. (1991) data if the data are used “as is’’ (i.e.. as 
anomalies relative 10 1950-79). The curves ‘-r 0 “19W and 
0^1954” show- the coverage changes which are obtained by re- 
quinag that the data coverage in D. (x, t) is a subset of the cover¬ 
age in Di(i) (see Appendix A). For the earlier reference period, 
the coverage fluctuates around ca. 925 grid-points from ca, 1910- 


198S, with minima of ca. 700 and 750 grid-points in the war years 
due to a reduction in oceanic data availability. The coverage is 
less stable with time for /„=1954, since hundreds of grid-points 
that had data in the reference period 19*6-60 did not have data 
in the first few decades of the century. Note, however, that the 
average coverage from ca. 1950-90 (roughly 1550 grid-points) is 
much higher than for tg=l906. Iu all three curves, recent reduc¬ 
tions in coverage reflect the degradation of the station observing 
network over land 
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Fig. A 2. Spatial coverage of annual mean near-surface tempera¬ 
ture data for two different reference periods (r ( ,= 1906 and 
id=1954; 



and C(r) to the choice of reference state would make 
interpretation of the results more difficult. 

In the ideal situation, in which there are no missing 
values in the observed data set. it can be shown analy¬ 
tically that the changes in C(r) as a function of time are 
independent of the choice of /<,. The same holds true 
for R(!) if the observed spatial variance. si(r). is 
roughly constant with time (as it is for temperature 
data: see Samer et al. 1993). 

However, the observed data do not comply with this 
ideal scenario, and show large, spatially non-random 
changes in coverage as a function of time. This is illus¬ 
trated in Figs. Al and A2. Since the maximum spatial 
coverage of observed data is determined by the cover¬ 
age over the reference period (this is because the sub¬ 
set of data in Zfc(x, r ) that is actually used must be 
equal to or a subset of the coverage of the reference 
state ^ (x) in order to calculate changes relative to 

(x)). changes in coverage with time will mean that 
the data employed in calculating C{ r) and RtJ) will de¬ 
pend on the choice of x 0 . so results may be sensitive to 
to- 

Furthermore, it may be difficult to determine 
whether a positive trend in C(i) or R{t) reflects a true 
time-increasing pattern similarity between the model 
and observed fields or the effect of coverage changes. 
In choosing t y . it is therefore desirable to minimize 
coverage changes with time, and yet still maintain ade¬ 
quate coverage. 

The effect of different choices of r 0 and attendant 
differences in coverage is illustrated in Figs. A3 and A4 


R(t): Sensitivity to observed reference period 



Time (years) 

AS - t,*im - t,ml9S4 

Fig. A3. Centered pattern correlation, R(i). between observed 
seasonally and annually averaged near-surface temperature 
changes and changes in the TP response experiment with com¬ 
bined sulfate aerosol/CO; forcing. Results are for two different 
choices of reference year: i„=1906 and r tf *=1954. Chances in R (r) 
with time are relatively insensitive to the choice of reference 
year. 


C(t): Sensitivity to observed reference period 



Fig. A4. As for Fig. A3, but for the uncemered pattern correla¬ 
tion C(j), without subtraction of model and observed spatial 
means 
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for R {:) and C(r). respectively. We consider the ex¬ 
treme cases 4—1906 and 4 = 1 954. Results are for the 
near-surface temperature signals in the TP experiment 
with combined COj/sUlfate aerosol forcing. For both 
statistics, it is clear that their changes in time (i.e.. BR/Bt 
and BCIBt) are. to first order, independent of 4 . 

These results have several important implications. 
First, it is apparent that linear trends in the pattern 
correlation time series are relatively insensitive to dif¬ 
ferences in observed data coverage for the two choices 
of to. Second, we note that the absolute values of both 
statistics are somewhat arbitrary, and can be raised or 
lowered by selection of a different reference state. This 
is not a problem here, since our interest is in trends 
rather than absolute values. 

Since our focus is on trends over the last 10- to 50 
years of the pattern correlation time series, we use 
4 = 1954. This guarantees that we have relatively stable 
observed data coverage over 1944—1995. and consider¬ 
ably higher coverage than for 4 = 1906 (see Figs. A1 
and A2). Tbe choice of a later 4 would improve cover¬ 
age still further, but have the undesirable effect of 
yielding large coverage changes in the 1950s (due to 
the introduction of data from Antarctica). 


Appendix B: comparison of variability in HAMCTL 
and GFDLCTL 

Features of the variability of near-surface air tempera¬ 
ture in the first 100-385 years of HAMCTL have been 
described by Cubasch et al. (1992. 1994). Samer et al. 
(1994,1995b) and Hegerl et al. (1994). HAMCTL exhi¬ 
bits substantial variability on decadal-to-century time 
scales (see Fig. Bla). However, part of this variability, 
particularly over the first 200 years of the integration, 
may be a model artifact and attributable to a form of 
climate drift. 

The variability in GFDLCTL (Fig. Bib) has been 
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analyzed by Delworth ei al. (1993) for the thermohal¬ 
ine circulation and by Stouffer ex al. (1994). Mehta and 
Delworth (1995). and Manabe and Stouffer (1995) for 
near-surface air temperature and SST. These investiga¬ 
tions have attempted to validate the interannual- to in- 
terdecadal time scale variability simulated by the 
GFDL CGCM. The study by Stouffer ei al. (1994) 
showed qualitative agreement between observed and 
model-simulated patterns of interannual near-surface 
temperature variability (with tbe exception of the trop¬ 
ica] Pacific, where both GFDLCTL and HAMCTL un¬ 
derestimate the variability associated with ENSO phe¬ 
nomena). It also indicated that modeled and observed 
relationships between global-mean changes and re¬ 
gional temperature changes show a broadly similar 
structure. Mehta and Delworth (1995) found consider¬ 
able similarity between simulated and observed pat¬ 
terns characterizing decadal rime scale variability of 
SST patterns in the tropical Atlantic. Such studies help 
to build confidence in tbe reliability of the model var¬ 
iability on interannual to decadal time scales. Valida¬ 
tion of mode] variability' on longer time scales is more 
problematical. 

The behavior of global-mean, annually averaged 
near-surface temperature is non-stationary in both 
HAMCTL and GFDLCTL. despite the application of 
flux adjustment schemes. The linear trend -in 
GFDLCTL (0.023 °C/century) js relatively monotonic, 
while the much larger trend in HAMCTL (0.102 a O 
century), is primarily due to a large negative excursion 
of temperature over the first two hundred years of the 
integration. The standard deviations of temperature 
for the detrended data differ by only 0.02 ®C (0.10'C 
for GFDLCTL versus 0.12 *C for HAMCTL). For the 
observations over the period 1861-2993. the corre¬ 
sponding figure is 0.21 °C if no overall trend is re¬ 
moved. If an EBM is used to remove a combined 
GHG/sulfaie aerosol effect from the observed data in 
an optimal way (c.f. Wigley and Raper 1991b). the 



T 


i_J 

1000 


Fig. Bla, b. Time series of global-mean, annually averaged near¬ 
surface temperature changes a in the 600-year HAMCTL and b 
1000-year GFDLCTL integrations. Both control runs were per¬ 
formed with CGCMS. Anomalies were defined relative to the 
mean state over the entire integration. The unfiltered and low- 


pass filtered time series arc shown, together with the overall 
least-squares linear trends. Both time series are non-stationary; 
the linear trend per century is roughly four times larger in 
HAMCTL than in GFDLCTL (0.102 ’C versus 0.023 "C, respec¬ 
tively) 
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Spectral density tor GFDLECHAM and UD-EBM control runs 



100 10 1 


PBriod (years) 

Fig. B2. Power spectra of global-mean, annually averaged, near- GFDLCTL) and four times the window width for the EBMJThe 

surface temperature changes to the 600-year HAMCTL and thin fines are the 9S% confidence intervals for the coupled model 

1000-year GFDLCTL integrations, and in a 5000-year integration spectral estimates. HAMCTL has higher power than GFDLCTL 

performed with an upwelling-diffusion EBM (Wigley and Raper for periods longer than roughly 60 year, the reverse is true for 

1991b). Spectra were computed by taking the Fourier transform periods shorter than 60 year. The EBM was run with random 

of the autocorrelation function, with the maximum number of forcing and a dimate sensitivity of 3.0*C for a doubling of COj. 

lags equal to i the length of the time series (see Jenkins and The bar on the upper lefL comer of the figure gives the EBM 

Watts 1968). Spectra were smoothed using a Tukey window, with spectral densities for climate sensitivities of 1.5 'C and 4.5 'C at a 

w idths of 60, 100, and 200 lags (for HAMCTL, period of 400 year. The coupled model spectra were computed 

GFDLCTL. and the EBM. respectively). The number of frequen- after removal of overall linear trends in the data 

ctes computed is twice the window width (for HAMCTL-and __ 


standard deviation of the residuals is 0.10°C. a result random forcing and a climate sensitivity, A T-ixco , of 

that is robust to uncertainties in the assumed anthro- 3.0*C for a doubling of atmospheric COj. As noted by 

pogenic forcing history. Hasselmann (1976) and Wigley and Raper (1990), the 

Power spectra of the detrended global-mean, an- low frequency spectrum depends on A T^co-- The 

nually averaged near-surface temperature changes for GFDL AGCM (coupled with a mixed-layer ocean) has 

HAMCTL and GFDLCTL indicate that GFDLCTL a sensitivity of 3.5 4 C (Stouffer, personal communica¬ 
nts more power than HAMCTL on time scales of tion), while the corresponding figure is 2.6 °C for the 

roughly 2-60 years, and less power than HAMCTL for ECHAM AGCM (see Table Bl in Gates et al. 1992). 

periods greater than ~»60 years (see Fig. B2). These The EBM value was chosen to fall roughly between 

results are relatively insensitive to the choice of win- the Hamburg and GFDL sensitivities. At high frequen- 

dow width for smoothing. Part of the explanation for cies. the EBM was tuned to agree with the observed 

the difference in power at the low-frequency end of the data after factoring out the effect of ENSO (which ac- 

coupled model spectra is the large negative tempera- counts for about 30% of the variance on time scales 

ture excursion over the first 200 years of HAMCTL. less than 10 years). Note that in the lower frequency 

Removal of the overall linear trend reduces, but does range, HAMCTL has higher power than GFDLCTL 

not remove the effect of this initial drift on the esti- despite having a lower value of A T^coi, a result oppo- 

raated spectrum. The areas defined by the upper and site io that expected on theoretical grounds, as the 
lower 95% confidence limits for each of the coupled EBM result on the left-hand side of Fig. B2 indicates, 

model spectra overlap at all frequencies, with mini- This is at least partially due to the above-mentioned 

mum overlap at the high- and low-frequency ends of climate drift in the first 200 years of HAMCTL. 
the spectra. In contrast to the HAMCTL and EBM results, the 

Figure B2 also shows the spectrum from a 5000-year spectrum for GFDLCTL flattens out at periods greater 

integration of an upwelling-diffusion energy-balance than 100 years. The EBM also shows a flattening 

model (EBM; Wigley and Raper 1990; 1991b) with of the spectrum, but at frequencies substantially 
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SIGNAL-TO-NOlSE RATIOS FOR C(t) TRENDS SIGNAlr-TO-NOISE RATIOS FOR R(t) TRENDS 



Trend length (years) Trend length (years) 

®- - GFDL . BCBAM Ba — GFDL - ECHAM 


Kg. B3. Sienal-io-DOise ratios. SN,. for observed trends in the un¬ 
centered partem correlation statistic C(r). The signals. are 
the observed trends in C(r) for the last 10- to 50 year. The noise. 
£*(0. is the standard error of the sampling distribution of BmU) 
(see Sea. 6.J and Appendix B for further details). Standard er¬ 
rors were computed using data from both GFDLCTL and 
HAMCTL Results are for annually averaged near-surface tem¬ 
perature data. For temperature-change signals from the Taylor 
and Fenner sulfate-only an d CO yonlv experiments. SN; is consis¬ 
tently lower for the GFDLCTL noise estimates. The GFDLCTL 
versos HAMCTL differences in SN,- can be interpreted in terms 


of the coupled-model differences in the spectral density of global- 
mean. annually averaged temperature on time scales of 10- to 50 
years (see Fig. B2). The thin solid ling gives the 5% significance 
threshold for an assumed Gaussian distribution of trends 

Fig. 64. As for Fig. B3. but for observed trends in the centered 
pattern correlation statistic R (r) and for seasonally-?veraged-data 
(JJA). SN; ratios arc very similar for the two noise estimates. 
This result implies some similarity in the HAMCTL and 
GFDLCTL patterns of near-surface temperature variability over 
the observed data window an 10- to 50-year time scales 


lower than in the GFDL model simulation (and 
beyond the lowest frequency shown in Figure B2). The 
ubiquity of substantial power at century lime scales in 
paleoclimate spectra (see. e.g.. Crowley and North 
1991; Stocker and Mysak 1992) suggests that either the 
paleodata have some low-frequency forcing that the 
model is lacking (e.g., solar variability, volcanoes) or 
that the model is underestimating the magnitude of 
century-time scale variability associated with internal 
oscillations of the coupled atmosphere-ocean system 
(see Barnett et al. 1995). A further possibility is that 
the paleodata may contain spurious low-frequencv var¬ 
iability. 

Since C(f) closely parallels the ratio between model 
and observed global-mean temperature changes, and 
since the standard error for a linear trend of length L, 
is related to the spectral power at frequency UL h one 
might expect the significance of C(r) values in Tables 
5-7 todiffer noticeably depending on whether 
HAMCTL or GFDLCTL was used to estimate the 
noise. This is difficult to determine on the basis of p- 
values, since these are generally highly significant for 
observed C(r) trends (Tables 5—7). We therefore com¬ 
puted signal-to-noise ratios for C(r) trends of different 
length. As in Santer et al. (1995b), we define the sig¬ 
nal-to-noise ratio SN ; for trend length i x 10 as 

SN, = /S s (i)/e„(i) (10) 


where &(i) is the observed trend in C(t) and £*,(/) is 
the standard enror of the sampling distribution of 
/3 a >Q). computed using either the GFDLCTL or 
HAMCTL data (see Sect. 6,1 for full details). The 
standard error is simply 

/Mi c) 7 (11) 

where m, is the sample size (the number of linear 
trends of length ixlO in the C/v(r) natural variability 
time series). 

Figure B3 shows SN, for 10- to 50-year linear trends 
in C(t). Results are for annually averaged signal and 
noise data. For temperature-change signals from the 
sulfate-only and C0 2 -only experiments, the higher 
GFDLCTL variance on 10- to 50-year time scales (Fig. 
B2) yields consistently lower signal-to-noise ratios than 
for HAMCTL. For trends 2:20 years, SN r is much 
larger than the 5% significance threshold (for an as¬ 
sumed Gaussian distribution), so that GFDLCTL ver¬ 
sus HAMCTL variance differences do not influence 
decisions on the significance of observed C(r) trends. 
As discussed previously, in the specific case of the SC 
experiment C(r) provides a mix of pattern- and global- 
mean change information. In this instance, therefore, 
differences in SN; cannot be interpreted solely in terms 
of the different behavior of GFDLCTL and HAMCTL 
global-mean temperature spectra in Fig. B2. 
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Fig. BSa-d. Empirical orthogonal functions (EOFs) 1 and 2 of the terpolated to the observed grid, and grid-points outside the ob- 
GFDLCTL and HAMCTL annually averaged near-surface tern- served data mask for to=1954 (see Fig. A2) were not used for 
pcrature anomalv data. EOFs were computed using data from computing EOFs. GFDLCTL (HAMCTL) EOFs 1 and 2 explain 

the first 400 year of GFDLCTL and the last 400 year of 13.2% and 8.9% (11.0% and 9.9%) of the total spacetime var- 

HAMCTL (in order to exclude the large negative temperature iance. respectively. There arc some large-scale similarities in the 

excursion during the first 200 year of HAMCTL; see Fig. Bla). EOF patterns: both coupled models have large loadings of the 

Anomalies were computed relative to the 400-year means of each same sign over high-latitude land areas of North America and 

control n>n. and were filtered as described in Sect 4. To facilitate Eurasia 

comparison with the observed R(i) results, model-data were in- ____ 

Signal-to-noise ratios for 10- to 50-year linear trends GFDLCTL explain 13.2% and 8.9% of the variance 

in R (f) are surprisingly similar, al least for JJA (Fig. (c.f. 11.0% and 9.9% for HAMCTL). The total numbeT 

B4). Note that the p-values for F(r) trends (Tables 5- of EOFs required to explain St95% Of the variance is 

7) are also similar for GFDLCTL and HAMCTL noise 69 for GFDLCTL and 63 for HAMCTL. The EOF pat- 

estimates. This suggests that the patterns of near-sur- terns show some large-scale spatial similarity: both 

face temperature variability on 10- to 50-vear time coupled models have large loadings of the same sign 

scales must show some degree of correspondence in over high-latitude land areas of North America and 

GFDLCTL and HAMCTL “To compare the dominant Eurasia. There are also pronounced differences. EOF 

patterns of variability, we computed empirical ortho- 2 of GFDLCTL has the same sign at virtually all grid- 

gonal functions (EOFs) of the HAMCTL and points, while such global-scale coherence is absent in 

GFDLCTL annually averaged near-surface tempera- the HAMCTL EOFs. The marked dipole structure 

ture data (using the first 400 years of GFDLCTL and over North America in the first pair Of HAMCTL 

the last 400 years of HAMCTL). Model anomaly data EOFs has no analogue in the dominant GFDLCTL 

were filtered as described in Sect. 4, and grid-points modes. 

outside the observed data mask for r 0 = 1954 were ex- A very different result is obtained if the full global 
eluded (see Fig. A2). This procedure allows us to com- fields are used for computing EOFs. In this case, differ- 
pare model variability modes over the same data win- ences in the behavior of sea-ice in the two integrations 
dow used for computing observed R{t) trends. yield a much flatter eigenvalue spectrum for 

The partitioning of total space-time variance is simi- GFDLCTL lhaD for HAMCTL. Further work is re-; 

lar in both control integrations. EOFs 1 and 2 of quired to identify common patterns of variability in the 
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